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SYNTHETIC MAMMALIAN a-N-ACETYLGLUCOSAMINIDASE AND 

GENETIC SEQUENCES ENCODING SAME 

FIELD OF THE INVENTION 

The present invention relates generally to mammalian a-N-acetylglucosaminidase and to 
genetic sequences encoding same and to the use of these in the investigation, diagnosis and 
treatment of subjects suspected of orsufiferingfrom a-N-acetylglucosaminidase deficiency. 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the 
nucleotide and amino acid sequences referred to m the specification are defined following 
the bibliography. 

Throughout this specification, unless the context requires otherwise, the word "comprise**, 
or variations such as "comprises** or "comprising", will be understood to imply the 
inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 

BACKGROUND TO THE INVENTION 

The increasing sophistication of recombinant DNA technology is greatly facilitating the 
efficacy of many commercially important industries including areas of medical and 
pharmaceutical research and development. The ability to purify native proteins and 
subsequently clone genetic sequences encoding these proteins is an important first step in 
the developm^t of a range of therapeutic and diagnostic procedures. However, 
practitioners have faced many difficulties in purifying target molecules to an extent 
sufficient to detennine amino acid sequences to permit the development of oligonucleotide 
probes to assist in the cloning of genetic sequences encoding the target molecules. 
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Such difficulties have been particularly faced in the research and development of lysosomal 
enzymes. An important lysosomal enzyme is a-N-acetylglucosaminidase (EC 2. 1.50). 
This enzyme acts as a exoglycosidase in lysosomes to hydrolyse the terminal a-N- 
5 acetylglucosamine residues present at the non-reducing terminus of fragments of heparan 
sulphate and heparin (Hopwood, 1989). A deficiency in this lysosomal hydrolase is 
responsible for the pathogenesis of Sanfilippo B (Mucopolysaccharidosis type IHB [MPS- 
IIIB]) syndrome (von-Figura and Kresse, 1972; OBrien. 1972). This is an autosomal 
recessive disorder of glycosaminoglycan catabolism leading to storage and excretion of 
1 0 excessive amounts of heparan sulphate and a variety of clinical phenotypes, but classically 
presenting with progressive mental retardation in conjunction with skeletal defonnities 
(McKusick and Neufcld, 1 983). 

There is a need, therefore, to purify a-N-acetylglucosaminidase and to clone genetic 
1 5 sequences encoding same to permit development of a range of therapeutic and diagnostic 
procedures to assist in the diagnosis and treatment of disease conditions arising from a-N- 
acetylglucosaminidase deficiency. 

20 SUMMARY OF THE INVENTION 

One aspect of the mvention provides an isolated nucleic acid molecule comprising a 
sequence of nucleotides which encodes or is complementary to a sequence which encodes 
a mammalian a-N*acetylgiucosaminldase or fragment or derivative thereof 

25 A second aspect of the mvention provides an isolated nucleic acid molecule comprising a 
sequence of nucleotides which is capable of hybridising imder at least low stringency 
conditions to a nucleotide sequence set forth in SEQ ID N0:1 or SEQ ID N0:3 or a 
complementary strand or a homologue, analogue or derivative thereof 

30 Another aspect of the invention is directed an isolated nucleic acid molecule which is at 
least 40% identical to the nucleotide sequence set forth in SEQ ID N0:1 or SEQ ID 
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NO: 3 or to a complementary strand thereof or a homologue, analogue or derivative 
thereof, 

A further aspect of the present invention provides a nucleic acid molecule comprising a 
S sequence of nucleotides encoding or complementary to a sequence encoding a polypeptide 
capable of hydrolysing the terminal a-N-acetylglucosamine residues present at the non- 
reducing terminus of fragments of heparan sulphate and heparin residues and wherein said 
nucleotide sequence is capable of hybridising under low stringency conditions to the 
nucleotide sequence set forth in SEQ ID NO: 1. ^ 

10 

A further aspect of the invention is directed to a genetic construct comprising a sense 
molecule, for the expression or over-expression of a-N-acetylglucosaminidase in 
prokaryotic or eukaryotic cells. 

1 5 A further aspect of the present invention is directed to synthetic a-N-acetylglucosaminidase 
or like molecule. 

A further aspect of the invention contemplates antibodies to a-N-acetylglucosaminidase 
and preferably synthetic a-N-acetylglucosaminidase or a like molecule. 

20 

In still yet another aspect of the present invention there is contemplated a method of 
diagnosing a mutation or other abberations in the a-N-acetylglucosaminidase gene in a 
human or anunal patient. 

25 Another aspect contemplates a method of treating patients suffering from a-N- 
acetylglucosaminidase deficiency, such as in MPS-IHB, said method comprising 
administering to said patient an effective amount of a-N-acetylglucosaminidase or active 
like form thereof 

30 Another aspect of the present invention is directed to a pharmaceutical composition 
comprising a recombinant mammalian a-N-acetylglucosaminidase or an active fragment 
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or derivative thereof and one or more pharmaceutically acceptable carriers and/or diluents. 



BRIEF DESCRIPTION OF THE DRAWmCS 

Figure 1 is a photographic representation of a-N-acetylglucosaminidase purified from 
human placenta following SDS/PAGE. Lane 1 : M, standards (kDa); Lanes 2 and 3 : 
purified a-N-acetylglucosaminidase from human placenta. Lane 4 and 5, bovine serum 
albumin. 

Figure 2 is a photographic representation of an SDS/polyacrylamide gel showing the 
molecular weights of recombinant a-N-acetylglucosaminidase polypeptides produced in 
CHO cells before (-) and after (+) PNGase F digestion. The 50 mM NaCl and 75 raM 
NaCl fractions are indicated. Molecular weights of a-N-acetylglucosaminidase 
polypeptides are indicated on the left of the figure. Molecular weights of marker proteins 
are indicated on the right hand side of the figure (lane 5). 

Single and three letter abbreviations of conventional amino acid residues as used herein are 
defmed in Table 1. 

Suitable amino acid substitutions referred to herein are defmed in Table 2. 

Codes for non-conventional amino acid residues as used herein are defined in Table 3. 
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TABLE 1 



Amino Acid Three-letter One-letter 

Abbreviation Symbol 



Alanine 


Ala 


A 


Arginine 


Are 


R 


Asparagine 


Asn 


N 


Aspartic acid 


• ASD 


D 


Cysteine 


Cys 


c 


Glutamine 


G\n 


0 


Glutamic acid 


Glu 


E 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


He 


I 


Leucine 


Leu 


L 


l^ySme 


i^ys 




Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


Any residue 


Xaa 


X 
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Suitable residues for amino acid substitutions 





j^x^qiplary SubstitutioQS 


Ala 


Ser 


Arg 


Lys 


Asa 


Gin; His 


Asp 


Glu 


Cys : 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Asn; Gin 


He 


Leu; Val 


Leu 


He; Val 


Lys 


Arg; Gin; Glu 


Met 


Leu; He 


Phe 


Met; Leu; Tyr 




Thr 


Thr 


Ser 


Tip 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



a 
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Non-conventional 
amino acid 



Code 



Non-conventional 
amino acid 



Code 



a-aminobutyric acid Abu 
a-amino-a-mediylbutyrate Mgabu 

aminocyclopropane- Cpro 
carboxylate 

10 aminoisobutyric acid Aib 

aminonorbornyl- Norb 
carboxylate 

cyclohexyialanine Chexa 

cyclopentylalanine Cpen 

15 D-alanine Dal 

D-arginine Darg 

D-aspartic acid Dasp 

D-cysteine Dcys 

D-glutamine Dgln 

20 D-glutamic acid Dglu 

D-histidine Dhis 

D-isoleucine Dile 

D^leucine Dleu 

D-lysine Dlys 

25 D-methionine Dmet 

D-ornithine Dorn 

D-phenylalanine Dphe 

D-proIine Dpro 

D-serine Dser 

30 D-tbreooine Dthr 

D-tryptophan Dtrp 



L-N-metbylalanine Nmala 

L-N-methylarginine Nmarg 

L-N-methylasparagine Nmasn 

L-N-methylaspartic acid Nmasp 

L-N-methylcysteine Nmcys 

L-N-methylglutamine Nmgln 

L-N-methylglutamic acid Nmglu 

L-N-methylhistidine Nmhis 

L-N-methylisolleucine Nmile 

L-N-methylleucine Nmleu 

L-N-methyllysine Nmlys 

L-N-methylmethionine Nmmet 

L-N-niethylnorleucine Nnmie 

L-N-metbylnorvaline Nmnva 

L-N-methylornithine Nmorn 

L-N-methylphenylalanine Nmphe 

L-N-methylproline Nmpro 

L^N-methylserine Nmser 

L-N^methylthreonine Nmthr 

L-N*methyltryptopban Nmtrp 

L^N-methyltyrosine Nmtyr 

L-N-methylvaline Nmval 

L-N-methylethylglycine Nmetg 

L-N-methyl-t-butylglycine Nmibug 

L-norleucine Nle 

L-norvaline Nva 
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D-tyrosme 


Dtyr 


D-valine 


Dval 


D-o-methylalanine 


Dmala 


D-a-methylargmme 


Dmarg 


D-o-methylasparaginc 


Dmasn 


D-a-inetbylaspartate 


Dmasp 


D-o-methylcysteine 


Dmcys 


D-a-methylglutamine 


Dmgln 


D- a-methy Ihistidine 


Dmhis 


D-a-methylisoleucine 


Dmile 


D-a-methylleucine 


Dmleu 


D-a-methyllysine 


Dmlys 


D-a-methylmethionine 


Dmmet 


D-a-methylornithine 


Dmorn 


D-a-methylphenylalanine 


Dmphe 


D-a-methylproline 


Dmpro 


D-a-mcthylserinc 


Dmser 


D-a-methylthreonine 


Dmthr 


D-a-mcthyltryptophan 


Dmtrp 


D-a-methykyrosine 


Dmty 


D-a-methylvaline 


Dmval 


D-N-methylalanine 


Dnmala 


D-N-methylarginine 


Dnmarg 


D-N-methylasparagine 


Dnmasn 


D-N-methylaspartate 


Dnmasp 


D-N-methylcysteine 


Dnmcys 
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u-meinyi-ammoisoDu tyrate 


MaiD 


a-metnyi-Y-ammouutyrate 


Mgabu 


a-metnyicyclODexyialanme 


Mchexa 


tt-rnetnyjcylcopcntylaianme 


Mcpen 


ct-memy i-ce-naptuy lal anme 


Manap 


a*metnylpemcillamme 


Mpen 


N-(4-aminobutyl)glycine 


Nglu 


N-(2-ammoetnyl)glycme 


Naeg 


iN-(j-ammopropyl)glyCme 


Norn 


iN-ammo-a-metnylDutyrate 


Nmaabu 


a-napthylalamne 


Anap 


N-benzylglycine 


Nphe 


N-(2-carbamylethyl)glycme 


Ngln 


N-(carbamylmethyl)glycme 


Nasn 


N-(2-carboxyetnyl)glycme 


Nglu 


N-(carboxymethyl)glycme 


Nasp 


N-cyclobutylglycme 


Ncbut 


N-cycloheptylglycme 


Nchep 


N-cycIohexylglycine 


Nchex 


IN -cycjoaecy ig ly Cine 


Ncdec 


iN-cylcodoaecylglycme 


Xf J J 

Ncdod 


w-cyciooctylglycme 


XT— _ » 

Ncoct 


N-cyclopropylglycine 


Ncpro 


iN-cyciounaecyigiycine 


Ncund 


N-(2,2-diphenylethyl) 




glycine 


Nbhm 


N-(3,3-diphenylpropyl) 




glycine 


Nbhe 
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D-N-methylglutamine Dnmgln N-(3-guanidinopropyl) 

glycine Narg 

D-N-methylglutamate Dnmglu N-(l-hydroxyethyl)glycine Nthr 

D-N-methylhistidine Dmnhis N-(hydroxyethyl))glyciDe Nser 

D-N-methylisoleucine Dnmile N-(unidazo]ylethyl)) 

glycine Nhis 
D-N-methylleucine Dnmleu N-(3-indolylyethyl) 

glycine Nhtrp 

D-N-methyllysine Dnmlys N-methyl-Y-aminobutyrate Nmgabu 

N-methylcyclohexylalanine Nmchexa D-N-methylmethionine Dnmmet 
D-N-methylornithine Dnmorn N-methylcycIopentylalanine Nmcpen 

N-methylglycine Nala D-N-methylphenylalanine Dmnphe 

N-methylaminoisobutyrate Nmaib D-N-methylproline Dnmpro 

N-(l-methylpropyl)glycine Nile D-N-methylserine Dnmser 

N-(2-methylpropyI)glycine Nleu D-N-methylthreonine Dnmthr 

D-N-methyltryptophan Dnmtrp N-(l-methylethyl)glycine Nval 

D-N-methyltyrosine Dnmtyr N-methyla-napdiylalanine Nmanap 

b-N-methylvaline Dnmval N-methylpenicillamine Nmpen 

Y-aminobutyric acid Gabu N-(p-hydroxyphenyl)glycine Nhtyr 

L-r-butylglycine Tbug N-(thiomethyl)glycine Ncys 

L-ethylglycine Etg penicillamine Pen 

L-bomopbenylalanine Hphe L-a-methylalanine Mala 

L-a*methyIarginine Marg L-a-raethylasparagine Masn 

L-a-methylaspartate Masp L-a-methyl-r-butylglycine Mtbug 

L-a-methylcysteine Mcys L-methylethylglycine Metg 

L-a-methylglutamine Mgln L-a-methylglutamate Mglu 

L-a-methylhistidine Mhis L-a-methylhomo 

phenylalanine Mhphe 
L-a-methylisoleucine Mile N-(2-methylthioethyl) 

glycme Nmet 

L-a-methylleucine Mleu L-o-methyllysme Mlys 
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L-a-methylmethiooine Mmet 

L-a-methylnorvaline Mnva 

L-a-methylpbenylalanine Mphe 

L-a-methylserine Mser 

L-a-methyltryptophan Mtrp 

L-a-methylvaline Mval 

N-(N-(2,2-diphenylethyl) 

carbamylmethyOglycine Nnbhm. 
1 -carboxy- 1 -(2, 2-diphenyl- 

ethylami]io)cyclopropane Nmbc 
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L-o-methylnorleucine Mule 

L-o-methylornithine Morn 

L-o-methylproline Mpro 

L-a-methyltbreonine Mthr 

L-a-methyltyrosine Mtyr 
L-N-methylhomo 

phenylalanine Nmhpbe 
N-(N-(3,3-diphenylpropyl) 

carbamylmethyOglycine Nnbhe 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention provides an isolated nucleic acid molecule comprising a sequence 
of nucleotides which encodes, or are complementary to a sequence which encodes, a 
5 mammalian a-N-acetylglucosaminidase or fragment or derivative thereof or its like 
molecule. 

Preferably, the mammal is a human, livestock animal, companion animal, wild animal or 
laboratory test animal (e.g. rabbit, rat, mouse or guinea pig). Most preferably, the mammal 
10 is a human. Conveniently, the a-N-acetylglucosaminidase is isolatable from the liver, 
kidney or placenta. However, the present invention extends to all mammalian a-N- 
acetylglucosaminidase enzymes and from any anatomical or cellular source and/or any 
biological fluid source, such as but not limited to plasma, serum, cell extract or lymph 
fluid. 

Although a preferred embodiment of the present invention contemplates the use of human 
a-N-acetylglucosaminidase or genomic or recombinant (e.g. cDNA) genetic sequences 
encoding same in the investigation, diagnosis and/or treatment of human subjects (i.e. 
homologous system), one skilled in the art will appreciate that the enzyme or genetic 
20 sequences encoding same from a non*human animal may also be useful. Such a 
heterologous system is encompassed by the present invention. 

The term "nucleic acid molecule** as used herein shall be taken to refer to any RNA or 
DNA (eg. cDNA) molecule, whether single-stranded or double-stranded or in a linear or 
25 covalmtly-closed fonn. The nucleic acid molecule may also be DNA corresponding to the 
entire genomic gene or a substantial portion thereof or a fragment or derivative thereof. 

The nucleic acid molecule of the present invention may constitute solely the nucleotide 
sequence encoding a-N-acetylglucosaminidase or a a-N-acetylglucosamuiidase-like 
30 molecule or may be part of a larger nucleic acid molecule. Accordingly, the present 
invention extends to the isolated genomic a-N-acetylglucosaminidase gene. The non- 
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translated sequences in a larger nucleic acid molecule may include vector, transcriptional 
and/or tianslational regulatory sequences, promoter, terminator, enhancer, replication or 
signal sequences or non-coding regions (eg intron sequences) of an isolated genomic gene. 

5 Reference herein to a "gene" is to be taken in its broadest context and includes: 

(i) a classical genomic gene consisting of transcriptional and/or iranslational 
regulatory sequences and/or a coding region and/or non-translated sequences (i.e. 
introns, 5'- and 3'- untranslated sequences); 

(ii) mRNA or cDNA corresponding to the coding regions (i.e. exons) 
10 optionally comprising 5'- or 3 '-untranslated sequences of the gene; or 

(iii) synthetic, amplified DNA fragments or other recombinant nucleic acid 
molecules produced in vitro and comprising all or a part of the coding region and/or 
5'- or 3 '-untranslated sequences of the gene. 

15 The term "gene" is also used to describe synthetic or fusion molecules encoding all or 
part of a functional product. A functional product is one which comprises a sequence of 
nucleotides or is complementary to a sequence of nucleotides which encodes a functional 
polypeptide, in particular a polypeptide having the catalytic activity of a-N- 
acetylglucosaminidase or a homologue,'analogue or derivative thereof 

20 

For the present purpose, "homologues" of a nucleotide sequence shall be taken to refer 
to an isolated nucleic acid molecule which is substantially the same as the nucleic acid 
molecule of the present invention or its complementary nucleotide sequence, 
notwithstanding the occurrence within said sequence, of one or more nucleotide 
25 substitutions, insertions, deletions, or rearrangements. 

"Analogues" of a nucleotide sequence set forth herein shall be taken to refer to an 
isolated nucleic acid molecule which is substantially the same as a nucleic acid molecule 
of the present invention or its complementary nucleotide sequence, notwithstanding the 
30 occurrence of any non-nudeotide constituents not normally present in said isolated 
nucleic acid molecule, for example carbohydrates, radiochemicals including 
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radionucleotides, reporter molecules such as, but not limited to DIG, alkaline 
phosphatase or horseradish peroxidase, amongst others. 

Derivatives" of a nucleotide sequence set forth herein shall be taken to refer to any 
S isolated nucleic acid molecule which contains significant sequence similarity to said 
sequence or a part thereof. Generally, the nucleotide sequence of the present invention 
may be subjected to mutagenesis to produce single or multiple nucleotide substitutions, 
deletions and/or insertions. Nucleotide insertional derivatives of the nucleotide sequence 
of the present invention include 5 ■ and 3 ' terminal fusions as well as intra-sequence 

10 insertions of single or multiple nucleotides or nucleotide analogues. Insertional 
nucleotide sequence variants are those in which one or more nucleotides or nucleotide 
analogues are introduced into a predetermined site in the nucleotide sequence of said 
sequence, although random insertion is also possible with suitable screening of the 
resulting product being performed. Deletional variants are characterised by the removal 

IS of one or more nucleotides from the nucleotide sequence. Substitutional nucleotide 
variants are those in which at least one nucleotide in the sequence has been removed and 
a different nucleotide or nucleotide analogue inserted in its place. 

Preferably, a homologue, analogue or derivative of an a-N-acetylglucosaminidase gene 
20 according to any embodiments described herein, comprises a sequence of nucleotides of 
at least 10 contiguous nucleotides derived from SEQ ID N0:1 or SEQ ID NO: 3 or a 
complementary strand thereof, wherein the sequence of said homologue, analogue or 
derivative is at least 40% identical to SEQ ID N0:1 or SEQ ID N0:3 or a 
complementary strand thereof or wherein said homologue, analogue or derivative is 
2S capable of hybridising to said sequence under at least low stringency hybridisation 
conditions. 

For the purposes of nomenclature, the nucleotide sequence set for in SEQ ID NO: 1 
relates to the cDNA encoding the human a-N*acetylglucosaminidase enzjone. 

30 

The nucleotide sequence set forth in SEQ ID N0:3 relates to the genomic gene equivalent 
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of the cDNA encoding the human liver a-N-acetyigiucosaminidase enzyme. Those 
skilled in the art will be aware that the specific exon sequences described in SEQ ID 
N0:3 correspond to the coding regions of the a-N-acetylglucosaminidase gene, said exon 
regions further comprising the entire open reading frame of the cDNA sequence set forth 
in SEQ ID N0:1, when aligned in a head-to-tail configuration. The intron sequences of 
SEQ ID N0:3, which correspond to non-coding regions of the gene which are spliced 
from the primary transcription product thereof, although not explicitly defined, may be 
readily deduced by those skilled in the art, when provided with the exon sequence data 
provided in the nucleotide sequence listing. 

The nucleotide sequence of the present invention may correspond to the sequence of the 
naturally-occurring a-N-acetylglucosaminidase gene or may comprise a homologue, 
analogue or derivative thereof which contains single or multiple nucleotide substitutions, 
deletions and/or additions. All such homologues, analogue or derivatives encode a-N- 
acetylglucosaminidase or a-N-acetylglucosaminidase-like molecules or a homologue, 
analogue or derivative thereof as cont«nplated by the present invention. The length of the 
nucleotide sequence may vary from a few bases, such as in nucleic acid probes or primers, 
to a full length sequence. 

The present invention is particularly directed to the nucleic acid in cDNA form and 
particularly when inserted into an expression vector. The expression vector may be 
replicable in a eukaiyotic or prokaryotic cell and may either produce mRNA or the mRNA 
may be subsequently translated into o-N-acetylglucosaminidase or like molecule. 
Particularly prefoxed eukaryotic cells include CHO cells but may be in any other suitable 
manunaiian cells or cell lines or non-manunalian cells such as yeast or insect cells. 

In an alternative embodiment, the presait invention provides a nucleic acid molecule 
comprising a sequence of nucleotides which encodes or are complementary to a sequence 
which encodes a polypeptide capable of hydrolysing the a-N-acetylglucosamine residues 
from the non-reducing terminus of heparan sulphate and heparin fragments and wherein 
said nucleotide sequence is capable of hybridising under at least low strmgency conditions 
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to a nucleotide sequence set forth in SEQ ID N0:1 or SEQ ID NO:3 or a homologue, 
analogue or derivative thereof. 

A second aspect of the invention provides an isolated nucleic acid molecule compris'mg a 
5 sequence of nucleotides which is capable of hybridising under at least low stringency 
conditions to a nucleotide sequence set forth in SEQ ID NO. l or SEQ ID N0:3 or a 
complementary strand or a homologue, analogue or derivative thereof. 

Preferably, hybridisation is possible imder at least medium stringent conditions. More 
10 preferably, hybridisation is possible under high stringent conditions. 

For the purposes of defming the level of stringency, reference can conveniently be made 
to Sambrook et al (1989) or Ausubel ei al (1987) which are herein incorporated by 
reference. 

15 

A low stringency is defmed herein as being a hybridisation and/or wash carried out in 4-6X 
SSC/0. 1-0.5% w/v SDS at 37-45 "C for 2-3 hours. A medium stringency hybridisation 
and/or wash is carried out in 1-4X SSC/0.25-0.5% w/v SDS at ^ 45''C for 2-3 hours and 
a high stringency hybridisation and/or wash is carried out 0. 1-IX SSC/0. 1% w/v SDS at 
20 60*Cfor 1-3 hours. 

Alternative conditions of stringency may be employed to those specifically recited herein. 
Generally, the stringency is increased by reducing the concentration of SSC buffer, 
and/or increasing the concentration of SDS and/or increasing the temperature of the 

25 hybridisation and/or wash. Those skilled in the art will be aware that the conditions for 
hybridisation and/or wash may vary depending upon the nature of the hybridisation 
membrane or the type of hybridisation probe used. Conditions for hybridisations and 
washes are well understood by one normally skilled m die art. For the purposes of 
clarification of parameters affecting hybridisation between nucleic acid molecules, 

30 reference is found m pages 2. 10.8 to 2. 10. 16. of Ausubel et al (1987), which is herein 
incorporated by reference. 
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Those skilled in the art will be aware that the nucleotide sequences set forth in SEQ ID 
NO: 1 and SEQ ID NO: 3 may be used to isolate the corresponding genes from other 
human tissues or alternatively, from the tissues or cells of other species, without undue 
experimentation. Means for the isolated of such related sequences will also be known 
to those skilled Ln the art, for example nucleic acid hybridisation, polymerase chain 
reaction, antibody screening of expression libraries, functional screening of expression 
libraries, or complementation of mutants, amongst others. The present invention is not 
to be limited by the source from which the specific gene sequences described herein have 
been isolated or by the means used tq isolate said sequences. 

In one embodmient, a related genetic sequence comprising genomic DNA, or mRNA. 
or cDNA is contacted with a hybridisation effective amount of a genetic sequence which 
encodes a-N-acetylglucosaminidase, or its complementary nucleotide sequence or a 
homologue, analogue, derivative or ftmctional part thereof, and then said hybridisation 
is detected using a suitable detection means. 

The related genetic sequence may be in a recombinant form, m a virus particle, 
bacteriophage particle, yeast cell, ammal cell, or a plant cell. Preferably, the related 
genetic sequence originates from an animal species or a human. More preferably, the 
related genetic sequence originates from a human. 

Preferably, die genetk: sequence which encodes a*N-acetylglucosaminidase (i.e probe 
or latter genetic sequence) comprises a sequence of nucleotides of at least 10 nucleotides, 
more preferably at least 20 nucleotides, even more preferably at least 50 nucleotides and 
even still more preferably at least 100 nucleotides derived from the nucleotide sequence 
set forth in SEQ ID NO: 1 or SEQ ID NO: 3 or a complementary sequence or a 
homologue, analogue or derivative thereof. 

Preferably, the detection means is a reporter molecule capable of givmg an identifiable 
signal (e.g. a radioisotope such as ^^P or'^S or a biotmylated molecule) covalently 
attached to the a-N-acetylglucosaminidase probe. 
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In an alternative embodiment, the detection means is a polymerase chain reaction. 
According to this embodiment, two opposing non-complementary nucleic acid "primer 
molecules" of at least 10 nucleotides in length, more preferably at least 20 nucleotides 
in length, derived from the nucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID 
5 NO: 3 may be contacted with a nucleic acid "template molecule" and specific nucleic acid 
molecule copies of die template molecule amplified in a polymerase chain reaction. 

The opposing primer molecules are selected such that they are each capable of 
hybridising to complementary strands of the same template molecule, wherein DNA 
10 polymerase-dependant DNA synthesis occurring from a first opposing primer molecule 
will be in a direction toward the second opposing primer molecule. 

Accordingly, both primers hybridise to said template molecule such that, in the presence 
of a DNA polymerase enzyme, a cofactor and appropriate substrate, DNA synthesis 
15 occurs in the 5' to 3' direction from each primer molecule towards the position on the 
DNA where the other prim^ molecule is hybridised, thereby amplifying the mtervening 
DNA. 

Those skilled in the art are aware of the technical requirements of the polymerase chain 
20 reaction and are ciq>able of any modifications which may be made to die reaction 
conditions. For example, of the polymerase chain reaction may be used m any suitable 
format, such as amplified fragment length polymorphism (AFLP), single-strand chain 
polymorphism (SSCP), amplification and mismatch detection (AMD), interspersed 
repetitive sequence polymerase chain reaction (IRS-PCR), inverse polymerase chain 
25 reacdon (iPCR) and reverse transaiption polymerase chain reaction (RT-PCR), amongst 
others, to isolate a related a-N-acetylglucosaminidase gene sequence or identify a 
mutation in an a-N-acetylglucosaminidase genetic sequence. Such variations of the 
polymerase chain reaction are discussed in detail by McPherson et al (1991), which is 
incorporated herein by reference. The present invention encompasses all such variations, 
30 the only requirement being that the final product of the reaction is an isolated nucleic acid 
molecule which is capable of encoding a-N-acetylglucosaminidase or a homologue. 
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analogue or derivative thereof. 

In a preferred embodiment, the first primer molecule is preferably derived from the sense 
strand of a gene which encodes a-N-acetylglucosaminidase, in particular from the 
nucleotide sequence set forth in SEQ ID NO: I or SEQ ID NO: 3 or a homologue, 
derivative or analogue thereof and the second primer molecule is preferably derived from 
the antisense strand of said gene. 

Those skilled in the art will be aware that it is not essential to the performance of the 
invention that the primer molecules be derived from the same gene. 

According to this embodiment, die nucleic acid primer molecule may furdier consist of 
a combination of any of the nucleotides adenine, cytidine, guanine, tiiymidine, or 
inosine, or fimctional analogues or derivatives thereof, capable of being incorporated into 
a polynucleotide molecule provided that it is capable of hybridising under at least low 
stringency conditions to the nucleic acid molecule set forth in SEQ ID NO: 1 or SEQ ID 
N0:3 or a homologue, analogue or derivative thereof. 

The nucleic acid prim^ molecules may further be each contained in an aqueous pool 
comprising other nucleic acid primer molecules. More preferably, the nucleic acid 
primer molecule is in a substantially pure form. 

The nucleic acid template molecule may be in a recombinant form, in a virus particle, 
bacteriophage particle, yeast cell, animal cell, or a plant cell. Preferably, the related 
genetic sequence originates from a cell, tissue, or organ derived from an animal species 
or a human. More preferably, the related genetic sequence originates from a cell, tissue, 
or organ derived from a human. 

Accordmgly, a third aspect of the present invention extends to an isolated nucleic acid 
molecule which is at least 40% identical to the nucleotide sequence set forth in SEQ ID 
NO: 1 or SEQ ID N0:3 or to a complementary strand thereof or a homologue, analogue 
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or derivative thereof. 

Preferably, the percentage identity to SEQ ID NO: 1 or SEQ ID N0:3 is at least about 
55%, still more preferably at least about 65%, yet still more preferably at least about 75- 
80% and even still more preferably at least about 85-95 %. 



In an even more preferred embodiment, die present invention provides an isolated 
nucleic acid molecule which is at least 40% identical to die nucleotide sequence set forth 
in SEQ ID NO: 1 or SEQ ID NO:3 or to a complementary strand diereof or a homologue, 
10 analogue or derivative thereof and is capable of hybridising under at least low stringency 
conditions to a nucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID N0:3. 



In a particularly preferred embodiment, the isolated nucleic acid molecule described 
herein is furdier capable of encoding a sequence of amino acids which is capable of 
1 5 carrying out die enzyme reaction catalysed by a a-N-acetylglucosaminidase enzyme. 

The isolated nucleic acid molecule of the present invention is also useful for developing 
a genetic construct comprising a sense molecule, for the expression or over-expression 
of a-N-acetylglucosaminidase in prokaryotic or eukaryotic cells. Particularly preferred 
20 eukaryotic cells include CHO cells but may be in any other suitable mammalian cells or 
cell lines or non-mammalian cells such as yeast or insect cells. 

The term "sense molecule** as used herein shall be taken to refer to an isolated nucleic 
acid molecule of the invention as described herein, which is provided in a format suitable 
25 for its expression to produce a recombinant polypeptide, when said sense molecule is 
introduced into a host cell. 

In a particularly preferred embodiment, a sense molecule which encodes the a-N- 
acetylglucosaminidase comprises a sequence of nucleotides set forth in SEQ ID NO: 1 or 
30 SEQ ID N0:3 or a complementary strand, homologue, analogue or derivative thereof. 
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In a most particularly preferred embodiment, the sense molecule of the invention 
comprises the sequence of nucleotides set forth in SEQ ID NO: 1 or a complementary 
strand, homologue, analogue or derivative thereof. 

5 Those skilled in the art will be aware that expression of a sense molecule may require the 
nucleic acid molecule of the invention to be placed in operable connection widi a 
promoter sequence to produce a *sense construct\ The choice of promoter for the 
present purpose may vary depending upon the level of expression of the sense molecule 
required and/or the tissue-specificity: or developmental-specificity of expression of the 
10 sense molecule which is required. The sense construct may further comprise a terminator 
sequence and be introduced into a suitable host cell where it is capable of being expressed 
to produce a recombinant polypeptide gene product. 

In the context of the present invention, a sense molecule which corresponds to a genetic 
15 sequence or isolated nucleic acid molecule which encodes a-N-acetylglucosaminidase 
polypeptide or a homologue, analogue or derivative thereof, placed operably under the 
control of a suitable promoter sequence, is introduced into a cell using any suitable 
method for the transformation of said cell and said genetic sequence or isolated nucleic 
acid molecule is expressed therein to produce said polypeptide. 

20 

The present invention clearly extends to genetic constructs designed to facilitate 
expression of any nucleic acid molecule described herein. 

A genetic construct of the present invention comprises the foregoing sense molecule, 
25 placed operably under the control of a promoter sequence capable of regulating the 
expression of the said nucleic acid molecule in a prokaryotic or eukaryotic cell, 
preferably a mammalian cell such as a CHO cell, a yeast cell, insect cell or bacterial cell. 
The said genetic construct optionally comprises, in addition to a promoter and sense 
molecule, a terminator sequence. 

30 

The term "terminator" refers to a DNA sequence at the end of a transcriptional unit 
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which signals termination of transcription. Terminators are 3 '-non-translated DNA 
sequences containing a polyadenytation signal, which facilitates the addition of 
polyadenylate sequences to the 3'-end of a primary transcript. Terminators active in 
plant cells are known and described in the literature. They may be isolated from 
S bacteria, fungi, viruses, animals and/or plants. 

Reference herein to a "promoter** is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA box 
which is required for accurate transcription initiation, with or without a CCAAT box 

10 sequence and additional regulatory elements (i.e. upstream activating sequences, 
enhancers and silencers) which alter gene e}q}ression in response to developmental and/or 
external stimuli, or in a tissue-specific manner. A promoter is usually, but not 
necessarily, positioned upstream or 5', of a structural gene, the expression of which it 
regulates. Furthermore, the regulatory elements comprising a promoter are usually 

1 5 positioned within 2 kb of the start site of transcription of the gene. 

In the present context, the term "promoter" is also used to describe a synthetic or fusion 
molecule, or derivative which confers, activates or enhances expression of said sense 
molecule in a cell. 

20 

Preferred prompters may contain additional copies of one or more specific regulatory 
elements, to further enhance expression of the sense molecule and/or to alter the spatial 
expression and/or temporal expression of said sense molecule. For example, regulatory 
elements which confer copper inducibilhy may be placed adjacent to a heterologous 
25 promoter sequence driving expression of a sense molecule, thereby conferring copper 
inducibility on the expression of said molecule. 

Placing a sense molecule under the regulatory control of a promoter sequence means 
positioning the said molecule such that expression is controlled by the promoter 
30 sequence. Promoters are generally positioned S' (upstream) to the genes that they 
control. In the construction of heterologous promoter/structural gene combinations it is 
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generally preferred to position the promoter at a distance from the gene transcription start 
site that is approximately the same as the distance between that promoter and the gene 
it controls in its natural setting, i.e., the gene from which the promoter is derived. As 
is known in the art, some variation in this distance can be acconmiodated without loss of 
5 promoter function. Similarly, the preferred positioning of a regulatory sequence element 
with respect to a heterologous gene to be placed under its control is defined by the 
positioning of the element in its natural setting, i.e., die genes from which it is derived. 
Again, as is known in the art, some variation in this distance can also occur. 

10 Examples of promoters suitable for use in genetic constructs of the present invention 
include viral, fungal, bacterial, animal and plant derived promoters capable of 
functioning in animal, human, yeast, insect or bacterial cells. The promoter may 
regulate the expression of the said molecule constitutively, or differentially with respect 
to the tissue in which expression occurs or, with respect to the developmental stage at 

1 5 which expression occurs, or in response to external stimuli such as physiological stresses, 
or plant pathogens, or metal ions, amongst others. Preferably, the promoter is capable 
of regulating expression of a sense molecule in a cell derived from an animal species or 
human. 

20 In a particularly preferred embodiment, the promoter is derived from the genomic gene 
encoding a-N-acetylglucosaminidase, preferably the human a-N-acetylglucosaminidase 
gene. In a more preferred embodiment, however, the promoter is derived from the 
nucleotide sequence set forth in SEQ ID N0:3 or is at least capable of hybridismg to 
nucleotide residues 1 to 989 of SEQ ID NO: 3 or at least 20 contiguous nucleotides 

25 derived therefrom. 

In an even more particularly preferred embodunent, the promoter is the CMV promoter 
sequence or a promoter sequence derived therefrom. 

30 An alternative embodiment of the invention is directed to a genetic construct comprising 
a promoter or functional derivative, part fragment, homologue, or analogue thereof, 
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derived from the a-N-acetylglucosaminidase genomic gene defined by SEQ ID NO: 3, 

Preferably, said genetic construct further comprises the a-N-acetylglucosaminidase 
sequence defined by SEQ ID NO: 1 placed in operably connection with said promoter. 

5 

A further aspect of the present invention is directed to synthetic a-N-acetylgiucosamioidase 
or like molecule. 

The term "synthetic" as used herein shall be taken to include both recombinant and 
10 chemically*synthesised molecules produced by the sequential addition of amino acid 
residues or groups of amino acid residues in defined order. 

In one embodiment, the invention relates to recombinant a-N-acetylglucosaminidase or 
like molecule encoded by or expressed from the nucleic acid molecules as hereinbefore 
IS described. 

In another embodiment, the synthetic a-N-acetylglucosaminidase or like molecule 
comprises a sequence of amino acids which is at least 40% identical to the amino acid 
sequence set forth in any one of SEQ ID Nos:2, 4, S or 6. 

20 

More preferably, the percentage identity is at least 60% and still more preferably at least 
80% or 85-90%. 

A particularly preferred embodiment of the present invention provides a synthetic a-N- 
25 acetylgluoosaminidase as hereinbefore defined which comprises a sequence of amino acids 
substantially as set forth in any one of SEQ ID Nos:2, 4, S or 6 or a homologue, analogue 
or derivative thereof. 

For the purposes of nomenclature, the amino acid sequence set forth in SEQ ID N0:2 
30 comprises the full-length translation produa of the htiman a-N-acetylglucosaminidase gene 
(i.e. hereinafter referred to as the "a-N-acetylglucosaminidase polypeptide" or "SEQ ID 



wo 97/1 9177 PCT/AU96/00747 

-24- 

N0:2") produced by expression of either the cDNA sequence defined by SEQ ID NO: I 
or the genomic gene defined by SEQ ID N0:3. The a-N-acetylglucosaminidase 
polypeptide comprises at least seven potentially-glycosylated Asn residues, at positions 
261, 272, 435, 503, 5 13, 526 and 532. Furthermore, the amino acid sequence of the a-N- 
5 acetylglucosaminidase polypeptide may comprise a signal peptide of approximately 23 
amino acid residues in length, with a probable site for signal peptide peptidase cleavage 
occurring between GIy23 and Asp24. 

The amino acid sequences set forth in; SEQ ID Nos:4-6 relate to N-terminal and internal 
10 (i.e. CNBr) amino acid sequences derived from human a-N-acetylglucosaminidase, 
purified as described in Example 1 . As described in Example 2, the purified form of the 
enzyme comprises two polypeptides having approximate molecular weights of 82 and 77 
kDa. The sequence set forth in SEQ ID N0:4 relates to the N-terminal sequence of the 82 
kDa polypeptide, while SEQ ID N0:5 relates to the N-termmal sequence of the 77 kDa 
15 polypeptide. Furthermore, SEQ ID N0:4 comprises amino acids residues 24-43 of SEQ 
, ID N0:2, while SEQ ID N0:5 comprises ammo acid residues 59-76 of SEQ ID N0:2. 

The amino acid sequence defined by SEQ ID N0:6 relates to the CNBr-cleaved peptide 
of purified human o-N-acetylglucosaminidase. This amino acid sequence aligns with 
20 amino acid residues 540-554 of the a-N-acetylglucosaminidase polypeptide (SEQ ID 
N0:2). 

In the present context, "homoiogues** of a polypeptide refer to those polypeptides, 
ensymes or proteins which have a similar a-N-acetylglucosaminidase enzyme activity, 
25 notwithstandmg any amino acid substitutions, additions or deletions thereto. A 
homologue may be isolated or derived from the same or another animal species. 

Furthermore, the amino acids of a homologous polypeptide may be replaced by other 
amino acids having similar properties, for example hydrophobicity, hydrophilicity, 
30 hydrophobic moment, charge or antigenicity, and so on. 
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"Analogues** encompass a-N-acetyigiucosaminidase polypq)tides and peptide derivatives 
thereof notwithstanding the occurrence of any non-naturally occurring amino acid 
analogues therein. 

S The term "derivative" in relation to the polypeptides of the invention refer to mutants, parts 
or fragments of a functional molecule. Derivatives include modified peptides in which 
ligands are attached to one or more of the amino acid residues contained therein, such 
as carbohydrates, enzymes, proteins, polypeptides or reporter molecules such as 
radionuclides or fluorescent compounds. Glycosylated, fluorescent, acylated or alkylated 
10 forms of the subject peptides are particularly contemplated by the present invention. 
Additionally, derivatives of a polypeptide may comprise fragments or parts of an amino 
acid sequence disclosed herein and are within the scope of the invention, as are 
homopolymers or heteropolymers comprising two or more copies of the subject 
polypeptides. Procedures for derivatizing peptides are well-known in the art. 

15 

Accordingly, this aspect of the present invention is directed to any proteinaceous molecule 
comprising an amino acid sequence corresponding to the full length mammalian a-N- 
acetylglucosaminidase enzyme or to a like molecule. The like molecule, therefore, 
comprises parts, derivatives and/or portions of the a-N-acetylglucosaminidase enzyme 
20 whether functional or not. 

Preferably, the mammal is human but may be of non-human origin as contemplated above. 

The synthetic or recombinant a-N-acetylglucosaminidase of the present invention may 
25 comprise an amino acid sequence corresponding to the naturally occurring amino acid 
sequence or may contain single or multiple amino acid substitutions, deletions and/or 
additions. The length of the amino acid sequence may range from a few residues to a full 
length molecule. 

30 Amino acid substitutions are typically of single residues. Amino acid insertions will 
usually be in the order of about 1-10 amino acid residues and deletions will range from 
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about 1-20 residues. Preferably, deletions or insertions are made in adjacent pairs, i.e. a 
deletion of two residues or insertion of two residues. 



Amino acid insertional derivatives of o-N-acetylglucosaminidase of the present invention 
5 include amino and/or carboxyl terminal fusions as well as intra-sequence insertions of 
single or multiple amino acids. Insertional amino acid sequence variants are those in which 
one or more amino acid residues are introduced into a predetermined site in the protein 
although random ins«tion is also possible with suitable screening of the resulting product. 
Deletional variants are characterised by. the removal of one or more amino acids from the 
1 0 sequence. Substitutional amino acid variants are those in which at least one residue in the 
sequence has been removed and a different residue inserted in its place. Typical 
substitutions are those made in accordance with the following Table 2: 

The amino acid variants referred to above may readily be made using peptide synthetic 
15 techniques well known in the art, such as solid phase peptide synthesis (Merrifield 
synthesis) and the like, or by recombinant DNA manipulations. Techniques for making 
substitution mutations at predetermined sites in DNA having known or partially known 
sequaice are well known and include, for example, M13 mutagenesis. The manipulation 
of DNA sequence to produce variant proteins which manifest as substitutional, insertional 
20 or deletional variants are conveniently elsewhere described such as Sambrook et al, 1 989 
Molecular Cloning: A Laboratory hdanual Cold Spring Harbor Laboratories, Cold Spring 
Harbor, NY. 

The derivatives or like molecules include single or multiple substitutions, deletions and/or 
25 additions of any component(s) naturally or aitificialiy associated with the a-N- 
acetylglucosaminidase eniyme such as carbohydrate, lipid and/or other proteinaceous 
moieties. For example, the present invention extends to glycosylated and non-glycosylated 
forms of the molecule. All such molecules are encompassed by the expression "mutants", 
•^derivatives", "fragments", "portions" and "like" molecules. These molecules may be 
30 active or non-active and may contain specific regions, such as a catalytic region. 
Particularly, preferred derivative molecules include those with altered glycosylation 
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patterns relative to the naturally occurring molecule. Even more particularly, the 
recombinant molecule is more highly glycosylated than the naturally occurring molecule. 
Such highly glycosylated derivatives may have improved take-up properties and enhanced 
half-lives. 

5 

As indicated in the Examples, the molecular weight of purified human a-N- 
acetylglucosaminidase (i.e. 82kDa and 77kDa) and recombinant mammalian o-N- 
acetylglucosaminidase produced in CHO cells (i.e. 89 kDa and 79 kDa) are greater than 
the deduced molecular weight of the- a-N-acetylglucosaminidase polypeptide set forth in 

1 0 SEQ ID No:2 (i.e. 70 kDa), suggesting that the purified and recombinant polypeptide are 
post-translationally modified. The data presented in Example 8 indicate further that the 
recombinant a-N-acetylglucosaminidase enzyme produced in CHO cells, at least, is 
glycosylated and that the difference in molecular weight determined for the recombinant 
polypeptides and the polypeptide of SEQ ID No: 2 is due almost entirely to glycosylatioh 

15 of the recombinant polypeptide by CHO cells. As shown in Example 9, the glycosylated 
recombinant a-N-acetylglucosaminidase polypeptide exhibits enzymatic activity. 

The present invention also extends to synthetic a-N-acetylglucosaminidase or like 
molecules when fused to other proteinaceous molecules. The latter may include another 
20 enzyme, reporter molecule, purification site or an amino acid sequence which facilitates 
transport of the molecule out of a cell, such as a signal sequence. 

The present invention extends further to post-translational modifications to the a-N- 
acetylglucosaminidase enzyme. The modifications may be made to the naturally occurring 
25 enzyme or following synthesis by recombinant techniques. The modifications may be at 
the structural level or at, for example, the electrochemical level such as modifying net 
charge or structural conformation of the enzyme. 

Such modification may be important to facilitate entry or penetration of the enzyme into 
30 selected tissues such as cartilage or blood brain barriers or to increase circulation half-life. 
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Analogues of a-N-acetylglucosaminidase contemplated herein include, but are not limited 
to, modifications to side chains, incorporation of unnatural amino acids and/or their 
derivatives during peptide synthesis and the use of crosslinkers and other methods which 
impose conformational constraints on the enzyme. 

5 

Examples of side chain modifications contemplated by the present invention include 
modifications of amino groups such as by reductive alkylation by reaaion with an 
aldehyde followed by reduction with NaBH4; amidination with methylacetimidate; 
acylation with acetic anhydride; c^amoylation of amino groups with cyanate; 
1 0 trinitrobenzylation of amino groups with 2, 4, 6-triiiitrobenzene sulphonic acid (TNBS); 
acylation of amino groups with succinic anhydride and tetrahydrophthalic anhydride; and 
pyridoxylation of lysine with pyridoxal-5 '-phosphate followed by reduction with NaBH4. 

The guanidino group of arginine residues may be modified by the formation of 
1 5 heterocyclic condensation products with reagents such as 2,3-butanedione, phenylglyoxal 
and glyoxal. 

The carboxyl group may be modified by carbodiimide activation via 0*acylisourea 
formation followed by subsequent derivatisation, for example, to a corresponding amide. 

20 

Sulphydryl groups may be modified by methods such as carboxymethylation with 
iodoacetic acid or iodoacetamide; performic acid oxidation to cysteic acid; formation of 
a mixed disulphides with other thiol compounds; reaction with maleimide, maleic 
anhydride or other substituted maleimide; formation of mercurial derivatives using 4* 
25 chloromercuribenzoate, 4-chloromercuriphenylsulphonic acid, phenyimercury chloride, 2- 
chloromercuric-4-nitrophenol and other mercurials; carbamoylation with cyanate at 
alkaline pH. 

Tryptophan residues may be modified by, for example, oxidation with N- 
30 bromosucdnimide or alkylation of the indole ring with 2-hydroxy-S-nitrobenzyl bromide 
or sulphenyl halides. Tyrosine residues on the other hand, may be altered by nitration with 
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tetranitromethane to form a 3-mtrotyrosine derivative. 

Modification of the imidazole ring of a histidine residue may be accomplished by 
alkylation with iodoacetic acid derivatives or N-carbethoxylation with 
5 diethylpyrocarbonate. 

Examples of incorporating unnatural amino acids and derivatives during peptide synthesis 
include, but are not limited to, use of norleucine, 4-amino butyric acid, 4-amino-3- 
hydroxy-S-phraylpentanoic acid, . 6-aminohe7canoic acid, t-butylglycine, norvaline, 
1 0 phenylglycine, ornithine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl 
alanine and/or D-isomers of amino acids. Non-naturally occurring amino acids 
contemplated by the present invention are incorporated herein, as Table 3. 

Crosslinkers can be used, for example, to stabilise 3D conformations, using homo- 
1 5 bifunctional crosslinkers such as the bifunctional imido esters having (CH2) „ spacer groups 
with n=l to n=6, glutaraldehyde, N-hydroxysuccinimide esters and hetero-biftinctional 
reagents which usually contain an amino-reactive moiety such as N-hydroxysuccinimide 
and another group specific-reactive moiety such as maleimido or dithio moiety (SH) or 
carfoodiimide (COOH). In addition, the enzyme could be conformationally constrained by, 
20 for example, incorporation of C, and N,-methylamino acids, introduction of double bonds 
between C. and Cp atoms of amino acids and the formation of cyclic peptides or analogues 
by introducing covalent bonds such as forming an amide bond between the N and C 

m 

termini, between two side chains or between a side chain and the N or C terminus. 

25 Electrochemical modifications of a-N-acetylglucosaminidase include interaction with 
polylysine or polyethylene glycol or other agent which effects an overall change to the net 
charge of the enzyme. 

Advantageously, the recombinant a-N-acetylglucosaminidase is a biologically pure 
30 preparation meaning that it has undergone some purification away for other proteins and/or 
non-proteinaceous material. The purity of the preparation may be represented as at least 
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40% of the enzyme, preferably at least 60%, more preferably at least 75%, even more 
preferably at least 85% and still more preferably at least 95% relative to non-a-N- 
acetylglucosaminidase material as determined by v^eight, activity, amino acid homology 
or similarity, antibody reactivity or other convenient means. 

5 

Particularly preferred methods for the preparation and purification of recombinant a-N- 
acetylglucosaminidase are provided in Examples 7 and 8. 

Those skilled in the art will be aware of the means of purifying a synthetic or recombinant 
10 a-N-^acetylglucosaminidase from several sources without undue experimentation and for 
expressing the degree of purity of such a purified preparation of the enzyme. 

The present invention further contemplates antibodies to a-N-acetylglucosaminidase and 
preferably synthetic a-N-acetylglucosaminidase or like molecule. The antibodies may be 
15 polyclonal or monoclonal, naturally occurring or synthetic (including recombinant, 
fragment or fusion forms). Such antibodies will be useful in developing immunoassays for 
a-N*acetylglucosaminidase and for identifying additional genetic sequences which are 
capable of e>q7res5ii% a-N-acetylglucosaminidase polypeptides or homologues, analogues 
or derivatives thereof. 

20 

Both polyclonal and monoclonal antibodies are obtainable by immunisation with an 
^propriate synthetic or recombinant gene product, or epitope, or peptide fragment of a 
gene product, in particular a a*N-acetylglucosamimdase polypeptide or a homologue, 
analogue or derivative thereof. 

25 

Alternatively, fragments of antibodies may be used, such as Fab fragments. The present 
invention extends further to encompass recombinant and synthetic antibodies and to 
antibody hybrids. A "synthetic antibody" is considered herein to include fragments and 
hybrids of antibodies. 

30 

A further aspect of the present invention contemplates a method of screening for mutations 
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or other abb^tions in the a-N-acetylglucosaminidase gene in a human or animal patient. 
Such a method may be accomplished in a number of ways including isolating a source of 
DNA to be tested or mRNA therefrom and hybridising thereto a nucleic acid molecule as 
hereinbefore described. Gen^^y, the nucleic acid is probe or primer size and polymerase 
5 chain reaction is a convenient means by which to analyse the RNA or DNA. Other suitable 
assays include the ligation chain reaction and the strand displacement amplification 
methods. The a-N-acetylglucosaminidase sequence can also be determined and compared 
to the naturally occurring sequence. Such methods may be useful in adults and children 
and may be adapted for a pre-natal test. The DNA to be tested includes a genomic sample 
10 carrying the a-N-acetylgluoosaminidase gene, a cDNA clone and/or amplification product. 

In accordance with this aspect of the present invention there is provided a method for 
screening for abbo-ations in the a-N-acetylglucosaminidase gene including the absence of 
such a gene or a portion or a substantial portion thereof comprising isolating a sample of 
1 5 DNA or mRNA corresponding to a region of said DNA and contacting same with an 
oligonucleotide probe capable of hybridising to one or more complementary sequences 
within the a-N-acetylglucosaminidase gene and then detecting the hybridisation, the extent 
of hybridisation or the absence of hybridisation. 

20 Alternatively, the probe is a primer and capable of direaing amplification of one or more 
regions of said a-N-acetylglucosaminidase gene and the amplification products and/or 
profile of amplification products is compared to an individual carrying the full gene or to 
a reference date base. 

25 Conveniently^ the amplification products are sequenced to determine the presence or 
absence of the full gene. 

The present invention extends to the use of any and all DNA-based or nucleic acid-based 
hybridisation and/or polymerase chain reaction formats as described herein, for the 
30 diagnosis of a disorder involving the a-N-acetylglucosaminidase gene in a human or 
animal patient 
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The present invention further extends to a method of treating patients suffering from a-N- 
acetylglucosaminidase deficiency, such as in MPS-IIIB, said method comprising 
administering to said patient an effective amount of a-N-acetylglucosaminidase or active 
like form thereof. 

5 

Preferably, the a-N-acetylglucosaminidase is in recombmant form. Such a method is 
referred to as "enzyme therapy". Alternatively, gene therapy can be employed including 
introducing an active gene (i.e. a nucleic acid molecule as hereinbefore described) or to 
parts of the gene or other sequences ^ich facilitate expression of a naturally occurring a- 
1 0 N-acetylglucosaminidase gene. 

Administration of a-N-acetylglucosaminidase for enzyme therapy may be by oral, 
intravenous, suppository, intraperitoneal, intramuscular, intranasal, intradermal or 
subcutaneous administration or by infusion or implantation. The a-N- 

1 S acetylglucosaminidase is preferably as h^einbefore described including active mutants or 
derivatives thereof and glycosylation variants thereof. Administration may also be by way 
of gene therapy including expression of the gene by inclusion of the gene in viral vectors 
which are inux)duced into the animal (e.g. human) host to be treated. Alternatively, the 
gene may be expressed in a bacterial host which is then introduced and becomes part of the 

20 bacterial flora in the animal to be tested. 

Still yet another aspect of the present invention is directed to a pharmaceutical composition 
comprising synthetic (e.g. recombinant) a-N-acetylglucosaminidase or like molecule, 
including active derivatives and fragments thereof, alone or in combination with other 
25 active molecules. Such other molecules may act synergistically with the enzyme or 
facilitates its entry to a target cell. The composition will also contain one or more 
pharmaceutically acceptable carriers and/or diluents. The composition may alternatively 
comprise a genetic component useful in gene therapy. 

30 The active ingredients of the pharmaceutical composition comprising the synthetic or 
recombinant a-N-acetylglucosaminidase or mutants or fragments or derivatives thereof are 
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contemplated to exhibit excellent activity in treating patients with a deficiency in the 
enzyme when administered in an amount which depends on the particular case. The 
variation depends, for example, on the patient and the a-N-acetylglucosaminidase used. 
For example, from about 0.5 ug to about 20 mg of enzyme per animal body or, depending 
S on the animal and other factors, per kilogram of body weight may be administered. Dosage 
regima may be adjusted to provide the optimum therapeutic response. For example, several 
divided doses may be administered daily, weekly, monthly or in other suitable time 
intervals or the dose may be proportionally reduced as indicated by the exigencies of the 
situation. Accordingly, alternative dosages in the order of 1.0 /:ig to IS mg, 2.0 /zg to 10 
10 mg or 10/ig to Smg may be administered in a single or as part of multiple doses. The 
active compound may be administered in a convenient manner such as by the oral, 
intravenous (where water soluble), intramuscular, subcutaneous, intranasal, intradermal or 
suppository routes or implanting (eg using slow release molecules). Depending on the route 
of administration, the active ingredients which comprise a synthetic (e.g. recombinant) a- 
1 5 N-acetylglucosaminidase or fragments, derivatives or mutants thereof may be required to 
be coated in a matmal to protect same from the action of enzymes, acids and other natural 
conditions which may inactivate said ingredients. For example, the low lipophilicity of a- 
N-acetylglucosaminidase will allow it to be destroyed in the gastrointestinal tract by 
enzymes capable of cleaving pq)tide bonds and in the stomach by acid hydrolysis. In order 
20 to administer the vaccine by other than parenteral administration, the enzyme will be 
coated by, or administered with, a material to prevent its inactivation. For example, the 
enzyme may be administered in an adjuvant, co-administered with enzyme inhibitors or 
in liposomes. Adjuvant is used in its broadest sense and includes any inunune stimulating 
compound such as interferon. Adjuvants contemplated herein include resorcinols, non- 
25 ionic surfactants such as polyoxyethylene oleyl ether and n-hexadecyl polyethylene ether. 
Conveniently, the adjuvant is Freund's Complete or Incomplete Adjuvant. Enzyme 
inhibitors include pancreatic trypsin inhibitor, diisopropylfluorophosphate (DEP) and 
trasylol. Liposomes include water-in-oil-in-water emulsions as well as conventional 
liposomes. 

30 

The active compound may also be administered in dispersions prepared in glycerol, liquid 
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polyethylene glycols, and/or mixtures thereof and in oils. Under ordinary conditions of 
storage and use, these preparations contain a preservative to prevent the growth of 
microorganisms. 

5 The pharmaceutical forms suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. In all cases the form must be sterile 
and must be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and mustbe preserved against the contaminating 

10 action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispmion medium oontaming, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, 
and vegetable oils. The proper fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the required particle size in the case of 

1 S dispersion and by the use of superfactants. The prevention of the action of microorganisms 
can be brought about by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thirmerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged 
absorption of the injectable compositions can be brought about by the use in the 

20 compositions of agents delayuig absorption, for example, aluminum mohostearate and 
gelatin. 

Sterile injectable solutions are prepared by incorporating the active compound in the 
required amount in the appropriate solvent with various of the other mgredients 

25 enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredient(s) into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 
from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and the 

30 freeze-drying tedinique which yield a powder of the active ingredient plus any additional 
desired ingredient from previously sterile-filtered solution thereof. 
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When the a-N-acetylglucosaminidase of the present bvention is suitably protected as 
described above, the composition may be orally administered, for example, with an inert 
diluent or with an assimilable edible carrier, or it may be enclosed in hard or soft shell 
gelatin capsule, or it may be compressed into tablets, or it may be incorporated directly 
5 with the food of the diet. For oral therapeutic administration, the active compound may be 
incorporated with excipients and used in the form of ingestible tablets, buccal tablets, 
troches, capsules, elixirs, suspensions, syiups, wafers, and the like. Such compositions and 
preparations should contain at least 1% by weight of active compound. The percentage of 
the compositions and preparations may, of course, be varied and may conveniently be 
1 0 between about 5 to about 80% of the weight of the unit. The amount of active compound 
in the vaccine compositions is such that a suitable dosage will be obtained. Preferred 
compositions or preparations according to the present invention are prepared, so that an 
oral dosage unit form contains between about 0.5 ug and 20 mg of active compound. 

15 The tablets, troches, pills, capsules and the like may also contain the following: a binder 
such as gum gragacanth, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as com starch, potato starch, alginic acid and the 
Uke; a lubricant such as magaesiimi stearate; and a sweetraing agent such a sucrose, lactose 
or saccharin may be added or a flavoring agent such as peppermint, oil of wmtergreen, or 

20 cherry flavouring. When the dosage unit form is a capsule, it may contain, in addition to 
materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, 
pills, or capsules may be coated with shellac, sugar or both. A syrup or elixir may contain 
the active compound, sucrose as a sweetening agent, methyl and propylparabens as 

25 preservatives, a dye and flavoring such as cherry or orange flavor. Of course, any material 
used in preparing any dosage unit form should be pharmaceutically pure and substantially 
non-toxic in the amounts employed. In addition, the active compound may be incorporated 
into sustained-release reparations and formulations. 

30 As used herein "pharmaceutically acceptable carriers and/or diluents" include any and all 
solvents, dispersion media, aqueous solutions, coatings, antibacterial and antifungal agents, 
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Lsotonic and absorption delaying agents, and the like. The use of such media and agents for 
pharmaceutical active substances is well known in the art Except insofar as any 
conventional media or agent is incompatible with the active ingredient, use thereof in the 
pharmaceutical compositions is contemplated. Supplementary active ingredients can also 
be incorporated into the compositions. 

The present invention further relates to the use of a-N-acetylglucosaminidase or active 
fragment, mutant or derivative thereof in the manufacture of a medicament for the 
treatment of patients suffering from ^deficiency in the naturally occurring enzyme (e.g. 
MPS-IDB). 

The present invention is further described with reference to the following non-limiting 
Examples. 



EXAMPLE! 
Purification of a-N-Acetylglucosaminidase 

a-N-acetylglucosaminidase was purified according to the method described in Weber ei 
al (1996). Enzyme was purified to hdmogeneity from human placenta. Evidence of 
purity is shown following SDS/PAGE which is represented in Figure 1 . All samples were 
reduced with dithiothreitol prior to electrophoresis. 

EXAMPLE 2 
Characterisation of a-N-Acetylglucosaminidase 

Results presented in Figure 1 show two polypeptides of about 82kDa and 77 kDa 
molecular weight, which correspond to a-N-acetylglucosaminidase polypeptides purified 
from human placenta according to Example 1. 



EXAMPLES 
Amino Add Sequence Determination 
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The N-temxinal amino acid sequences the 77 kDA and 82 kDa a-N*acetylglucosaminidase 
polypeptides, in addition to the amino acid sequence of an internal CNBr cleavage product 
of these peptides, were determined using the methods of Weber ei aL (1996). 

The amino acid sequences are shown in Table 4. 
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TABLE 4 

N-Terminal amino acid sequences (SEQ ID NO:4 and SEQ ID No:5) and 
CNBr peptide sequence (SEQ ID No:6) determined from purified human 

a-N-Acetylglucosaminidase 

5 



polypeptide 82 kDa 


DEAREAAAVRALVARLLGPG 


polypeptide 77 kDa 


KPGLDTYSLGGGGAAX' VR 


CNBr peptide 


WRLLLTSAPSLX' TX'P 



no residue could be identiiied for this position, indicating that this residue could be 
I S phosphorylated or glycosylated. 

EXAMPLE 4 
Cloning of a-N-Acetylglucosaminidase cDNA 

20 

Oligonucleotide probes were prepared based on the partial amino acid sequences obtained 
for the purifled a-N-acetylglucosaminidase polypeptides (Example 3). The probes were 
subsequently used to screen a human peripheral blood leukocyte cDNA library. An 
approximately 2.6 kbp cDNA clone was isolated encoding most of the sequence of human 
25 a-N-acetylglucosaminidase (SEQ ID NO: 1 ). 

The remaining a-N-acetylglucosaminidase coding sequence was obtained from the 
nucleotide sequence of the corresponding genomic gene (SEQ ID N0:3X isolated by 
hybridisation to a human chromosome 1 7 library (Weber etMi 1996). 
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The complete open reading frame is 2232 nucleotides long and encodes a 743 (plus stop 
codon) amino acid protein. The predicted molecular mass of the longest mature protein 
(minus the 23 amino acid N-terminai signal peptide) is about 79,622 daltons. 

5 The amino acid sequence of a-N-acetylglucosaminidase is shown in SEQ ID N0:2. The 
deduced molecular weight of the desired amino acid sequence of a-N- 
acetylglucosaminidase is approximately 70kDa. The probable site of signal peptide 
peptidase cleavage is between amino acids 23 and 24. There are seven potential N- 
glycosylation sites in the sequence. 

0 

The nucleotide sequence of the corresponding a-N-acetylglucosaminidase genomic gene 
(SEQ ID No:3) comprises 10380 bp including 889 bp of 5' upstream sequence 
corresponding to at least at part of the a-N-acctylglucosarainidase promoter sequence, in 
addition to the nucleotide sequences of introns I, II, II, IV, V, in addition to 1 326 bp of 3'- 
S untranslated sequence. 

EXAMPLES 
Construction of an expression vector comprising the 
a-N-Acetylgluco^aminidase cDNA sequence 

The cDNA insert of k clone pbi 33. containing bases 107 to 2575 of the a-N- 
acetylglucosaminidase cDNA was excised with EcoRl and subcloned into pBluescript 11 
SK-(Stratagene). A 178 bp Xmal fragment (bases 1 to 178 of the a-N- 
acetylglucosaminidase cDNA) finom cosmid sub-done 6.3, containing the start codon, was 
cloned into the pBluescript subclone to produce a full-length cDNA sequence in addition 
to 101 bp of 5' non^translated sequence as well as 245 bp of 3' non-translated region 
including the polyadenylation-site, the polyA-tail and linkerDNA. The fiill length cDNA 
was directionaUy cloned into the pCDNA3 expressionvector (Invitrogen) via the EcoRl and 
BamUl sites. 

EXAMPLE 6 
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Expression of recombinant a-N-acetylglucosaminidase 

Chinese Hamster Ovary (CHO) cells were transfected with expressionvector using the 
DOTAP transfection reagent (Boehringer Mannheim) according to the manufacturers 
5 instructions. Cells were grown in Ham's F12 medium, 10% (v/v) fetal calf senim, 
penicillin and streptomycin sulfate at 100 fxg/ml each. Cells were grown in nonselective 
medium for 48 h and then incubated in medium containing 750 fig/ml G418 sulfate 
(Geniticm) until resistant colonies emerged. 

1 0 Single cell clones were grown up and 26 of them were tested for expression of recombinant 
a-N-acetylglucosaminidase with a fluorogenic a-N-acetylglucosaminidase substrate, (i.e. 
N-acetylglucosamine a-linked to 4-methylumbelliferone) 

EXAMPLE 7 

1 S Large scale a-N-acetylglucosaminidase production 

2 g of Cytodex 2 microcarrier beads were swollen in 250 ml of PBS for 3 h at 37''C with 
three changes of PBS and then autoclaved for 15 min at HO^'C (wet cycle). The beads 
were then rinsed with sterile growth medium (Coons/DMEM, 10% v/v fetal calf serum, 

20 penicillin and streptomycin sulfate at 100 /xg/ml each and 0.1% w/v Pluronic F68) and 
transferred into a Techne stirrer culture flask. The microcarrier beads were inoculated with 
seven confluent 175 flasks of the cell clone showing the highest expression of recombinant 
a-N-acetylglucosaminidase. Growth medium was added up to 200 ml and the culture 
incubated with a stirrer speed of 20 rpm to achieve an even distribution of cells on the 

25 beads. The cells were allowed to attach to the beads for 16 h at low speed then medium 
was added up to 500 ml and the stirrer speed increased to 30 rpm. After a growth phase 
of 48 to 72 h with daily aerating to allow gas exchange the beads were completely covered 
with cells and the medium was exchanged for production medium (Coons/DMEM, no fetal 
calf s^imi, penicillin and streptomycin sulfate at 100 /zg/ml each, 0. 1% w/v Pluronic F68 

30 and 5 mM NH4CI). The glucose concentration was monitored daily and the medium 
replaced, when glucose fell below 5 mM every 203 days. The harvested medium contained 
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approximately 2 mg a-N-acetylglucosaminidase protein per dm^ of production medium. 



EXAMPLE 8 

Purification of recombinant a-N-acetylglucosaminidase 

5 

Production medium was dialysed against 50 mM NaAc pH 5.5 and loaded onto a heparin- 
agarose column equilibrated in the same buffer. After washing with NaAc buffer and 
NaAc/50 mM NaCl the column was eluted with 75 mM NaCl in NaAc buffer. The eluate 
was dialysed against 20 mM Tris/HCl pH 7.5, loaded onto a DEAE Scphacel column, 

1 0 washed with 25 mM NaCl in 20 mM Tris/HCl and then eluted with 50 and 75 mM NaCl 

« 

in 20 mM Tris/HCl respectively. 



SDS-PAGE of the two eluates showed two polypeptide bands associated with enzyme 
activity with apparent molecular weights of 79 and 89 kDa. The smaller a-N- 
1 5 acetylglucosaminidase was eluted predominantly in the 50 mM NaCl fraction whereas the 
89 kDa a-N-acetylglucosaminidase polypeptide was enriched in the 75 mM NaCl fraction 

(Fig. 2). 

The difference in apparent molecular weight of the recombinant a-N- 
20 acetylglucosaminidase polypeptides is due to the presence of additional carbohydrate side 
diains, since a digest with PNGase F, which cleaves ofTN-glycosylation groups, reduced 
both the 79 kDa and 89 kDa polypeptides to the polypeptide band having an apparent 
molecular weight of about 70 kDa (Fig. 2), which corresponds to the approximate 
molecular weight deduced from primary amino acid sequence data (SEQ ID No:2). 

25 

EXAMPLE 9 

Characteristics of recombinant a-N*acetyIglucosaminidase 

No differences were observed between the enzyme activities of the 79 and 89 kDa 
30 recombinant a-N-acetylglucosaminidase polypeptides produced in CHO cells according 
to Example 7 and 8. With the fluorogenic N-acetylglucosamine a-linked to 4- 
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methylumbelliferone (4-MLI) substrate, the enzyme has a pH-optimum of 4.6 with a of 
5,34 mM and a V^^ of 3.97 x 10^ pmol/min/mg. Towards a ^H-labelled disaccharide 
substrate it should a pH-optimum of 4.1 with a of 0.0166 mM and a of 4.48 x 10* 
pmol/min/mg. 

5 

EXAMPLE 10 
Mutational analysis of Sanfilippo B patients 

Genomic DNA is isolated from cultivated skin fibroblasts of patients by extraction widii 
10 Phenol/Chloroform and used to amplify the eight exons and adjacent intronic sequences 
individually by PCR. 

Primer sequences used in the amplification reaction are readily determined from the 
nucleotide sequences of the a-N-acetylglucosaminidase cDNA and genomic clones, set 

15 forth in SEQ ID No:l or SEQ ID No:3. Amplification conditions are also readily 
determined without undue iexperimentation. Procedures for the design of PCR primers and 
amplification conditions are described in detail, for example, by McPherson et al (1991). 
Differences in the primary sequence can be identified by separating the PCR products on 
a polyacrylamide gel under non-denaturing conditions (SSCP gels). Base changes, 

20 insertions and deletions will lead to a different band pattern compared with the wildtype 
in most of the cases, which can be visualised either by autoradiography of the gel after 
labelling the DNA during the PCR or by staining unlabelled DNA in the gel with silver. 
PCR products which show a different band pattern are sequences to identify the change. 
Other patient samples can be tested for mutations and polymorphism that were found by 

25 hybridisation with wildtype- and mutation-specific oligonucleotides (ASO). 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood 
that the invention includes all such variations and modifications. The invention also 
30 includes all of the stq)s, features, compositions and compounds referred to or indicated in 
this specification, individually or collectively, and any and all combinations of any two or 
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more of said steps or features. 
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{!) GENERAL INFORMATION: 

(i) APPLICANT: WOMEN'S AND CHILDREN'S HOSPITAL 

(US ONLY:HOPWOOD JJ, WEBER B SCOTT HS, BLANCH LC, ANSON D) 



(ii) TITLE OF INVENTION; SYNTHETIC MAM«1ALIAN a-N-ACETYLGLUCOSAMINIDASE AND 

GENETIC SEQUENCES ENCODING SAME 

(ill) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS ' . 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: I LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

|vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: AU PN6748/95 

(B) FILING DATE: 23-NOV-1995 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER :PCT INTERNATIONAL 

(B) FILING DATE: 22-NOV-I996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: HUGHES, DR E JOHN L 

(C) REFERENCE/DOCXET NUMBER: MRO/EJH/JMC . 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE :-l>€l 3 9254 2777 

(B) TELEFAX: -^61 3 9254 2770 
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(2) INFORMATION FOR SEQ ID NOil: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Peripheral Blood 

(GJ CELL TYPE: Leukocyte 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 102.. 2333 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CCCGGGCTTA GCCTTCGGGT CCACGTGGCC GGAGQCCGGC AGCTQATTGG ACOCGGGCCG 60 

CCCCACCCCC TGGCCGTCGC GGGACCCGCA GGACTGAGAC C ATO GAG GCG GTO 113 

Met Glu Ala Val 
1 

GCG GTG GCC GCG GCG GTG GGO GTC CTT CTC CTQ GCC GGG GCC GGG GQC 161 
Ala Val Ala Ala Ala Val Gly Val Leu Leu Leu Ala Gly Ala Gly Gly 
5 10 15 20 

GCG GCA GGC GAC GAG GCC CGG GAG GCG GCG GCC GTG CGG GCG CTC GTG 209 
Ala Ala Gly Asp Glu Ala Arg Glu Ala Ala Ala Val Arg Ala Leu Val 

25 30 35 

GCC CGG CTG CTG GGG CCA GGC GCC GCG GCC GAC TTC TCC GTG TCG GTG 257 
Ala Arg hen Leu Gly Pro Gly Pro Ala Ala Asp Phe Ser Val Scr Val 

40 45 50 

GAG CQC GCT CTG GCT GCC AAG CCG GGC TTG GAC ACC TAC AGC CTQ GGC 305 
Glu Arg Ala Leu Ala Ala Lys Pro Gly Leu Asp Thr Tyr Ser Leu Gly 
55 60 65 

GGC GGC GGC GCG GCG CGC GTG CGG GTG CGC GGC TCC ACQ GGC GTG GCG 353 
Gly Gly Gly Ala Ala Arg Val Arg Val Arg Gly Ser Thr Gly Val Ala 
70 75 80 



GCC GCC GCG GGG CTG CAC CGC TAC 
Ala Ala Ala Gly Leu His Arg Tyr 
B5 90 



CTG CGC GAC TTC TGT GGC T6C CAC 
Leu Arg Asp Phe Cys Gly Cys His 
95 100 



401 



wo 97/19177 PCT/Al)96/00747 

.47. 

GTO GCC TOO TCC GGC TCT CAG CTG CGC CTG CCG CGQ CCA CTQ CCA OCC 449 

Val Ala Trp Ser Gly Ser Gin Leu Arg Leu Pro Arg Pro Leu Pro Ala 

105 110 115 

GTG CCG GGO GAG CTG ACC GAG GCC ACG CCC AAC AGG TAC CGC TAT TAC 497 
Val Pro Gly Glu IjCU Thr Glu Ala Thr Pro Asn Arg Tyr Arg Tyr Tyr 
120 125 130 

CAG AAT GTG TGC ACG CAA AGC TAC TCC TTC GTG TGG TGG GAC TGG GCC 545 
Gin Asn Val Cye Thr Gin Ser Tyr Ser Phe Val Trp Trp Aep Trp Ala 
135 140 145 

CGC TGG GAG CGA GAG ATA GAC TGG ATG GCG CTG AAT GGC ATC AAC CTG 593 
Arg Trp Gl.u Arg G^\1 Tie Awp Trp Mftf- AT^i T.*?u Am r;Ty Hp a«».n T^»i 
ISO 155 160 

GCA CTG GCC TGG AGC GGC CAG GAG GCC ATC TGG CAG CGG GTG TAC CTG 641 
Ala Leu Ala Trp Ser Gly Gin Glu Ala lie Trp Gin Arg Val Tyr Leu 
165 170 175 160 

GCC TTG GGC CTG ACC CAG GCA GAG ATC AAT GAG TTC TTT ACT GGT CCT 689 
Ala Leu Gly Leu Thr Gin Ala Glu lie Aen Glu Phe Phe Thr Gly Pro 

185 190 195 

GCC TTC CTG GCC TGG GG6 CGA ATG GGC AAC CTG CAC ACC TGG GAT GGC 737 
Ala Phe Leu Ala Trp Gly Arg Met Gly Asn Leu His Thr Trp Asp Gly. 
200 205 210 

CCC CTG CCC CCC TCC TGG CAC ATC AAO CAG CTT TAC CTG CAG CAC CGG 785 
Pro Leu Pro Pro Ser Trp Hi a lie Lye Gin Leu Tyr Leu Gin His Arg 
215 220 225 

GTC CTG GAC CAG ATG CGC TCC TTC GGC ATG ACC CCA GTG CTG CCT GCA 833 
Val Leu Aep Gin Met Arg Ser Phe Gly Met Thr Pro Val Leu Pro Ala 
230 235 240 

TTC GCG GGG CAT GTT CCC GAG OCT GTC ACC AGG GTG TTC CCT CAG GTC 881 
Phe Ala Gly Hie Val Pro Glu Ala Val Thr Arg Val Phe Pro Gin Val 
245 250 255 260 

AAT GTC ACG AAG ATG GGC AGT TGG GGC CAC TTT AAC TGT TCC TAC TCC 929 
Asn Val Thr Lys Met Gly Ser Trp Gly His Phe Asn Cys Ser Tyr Ser 

265 270 275 

TGC TCC TTC CTT CTG GCT CCG GAA GAC CCC ATA TTC CCC ATC ATC GGG 977 
Cys Ser Phe Leu Leu Ala Pro Glu Asp Pro lie Phe Pro lie lie Gly 
280 285 290 

AGC CTC TTC CTG CGA GAG CTG ATC AAA GAG TTT GGC ACA GAG CAC ATC 1025 
Ser Leu Phe Leu Arg Glu Leu Zle Lya Glu Phe Gly Thr Asp His lie 
295 300 305 
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TAT GGG GCC GAC ACT TTC AAT GAG ATG CAG CCA CCT TCC TCA GAG CCC 1073 
Tyr Gly Ala Aop Thr Phe Aen Glu Met Gin Pro Pro Ser Ser Glu Pro 
310 315 320 

TCC TAC CTT GCC GCA GCC ACC ACT OCC OTC TAT OAS GCC ATG ACT OCA 1121 
Ser Tyr Leu Ala Ala Ala Thr Thr Ala Val Tyr Glu Ala Met Thr Ala 
325 330 335 340 

GTG GAT ACT GAG GCT GTG TGG CTG CTC CAA GGC TGG CTC TTC CAG CAC 1169 
Val Asp Thr Glu Ala Val Trp Leu Leu Gin Gly Trp Leu Phe Gin Hia 

345 350 355 

CAG CCG CAG TTC . TGG GGG CCC GCC CAG ATC AGG GCT GTG CTG GGA GCT 1217 
Oln Pro Phe Trp r;ly Pro Ala nln np> Arg Al;^ V^l G^y f^^f^ 

360 365. 3 70 

GTG CCC CGT GGC CGC CTC CTG GTT CTG GAC CTG TTT GCT GAG AGC CAG 1265 
Val Pro Arg Gly Arg Leu Leu Val Leu Asp Leu Phe Ala Glu Ser Gin 
375 380 385 

CCT GTG TAT ACC CGC ACT GCC TCC TTC CAG GGC CAG CCC TTC ATC TGG 1313 
Pro Val Tyr Thr Arg Thr Ala Ser Phe Gin Gly Gin Pro Phe He Trp 
390 395 400 

TGC ATG CTG CAC AAC TTT GGG GGA AAC CAT OGT CTT TTT GGA GCC CTA 1361 
Cys Met Leu His Asn Phe Qly Gly Asn His Gly Leu Phe Gly Ala Leu 
405 410 415 420 

GAG GCT GTG AAC GGA GGC CCA GAA GCT GCC CGC CTC TTC CCC AAC TCC 1409 
Glu Ala Val Asn Gly Gly Pro Glu Ala Ala Arg Leu Phe Pro Asn Ser 

425 430 435 

ACC ATG GTA. GGC ACG GGC ATG GCC CCC GAG GGC ATC AGC CAG AAC GAA 1457 
Thr Met Val Gly Thr Gly Met Ala Pro Glu Gly He Ser Gin Asn Glu 
440 445 450 

GTG GTC TAT TCC CTC ATG OCT GAG CTG GGC TGG CGA AAG GAC CCA GTC 1505 
Val Val Tyr Ser Leu Met Ala Glu Leu Qly Trp Arg Lys Asp Pro Val 
455 460 465 

CCA GAT TTG GCA GCC TGG GTG ACC AGC TTT GCC GCC CGG CGO TAT GGG 1553 
Pro Asp Leu Ala Ala Trp Val Thr Ser Phe Ala Ala Arg Arg Tyr Gly 
, 470 475 480 

GTC TCC CAC CCG QAC GCA GGG GCA GCG TGG AGG CTA CTG CTC CGG AGT 1601 
Val Ser His Pro Asp Ala Gly Ala Ala Trp Arg Leu Leu Leu Arg Ser 

490 495 500 



GTG TAC AAC TGC TCC GGG GAG GCC TGC AGG GGC CAC AAT CGT AGC CCG 
Val Tyr Asn Cye Ser Gly Glu Ala Cys Arg Qly His Asn Arg Ser Pro 

505 510 515 



1649 



V 
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CTG GTC AGG CGG CCG TCC CTA CAG ATG AAT ACC AGC ATC TGG TAG AAC 1697 
Leu Val Arg Arg Pro Ser Leu Qln Met Aen Thr Ser lie Trp Tyr Aan 
520 525 530 

CGA TCT GAT GTQ TTT GAG GCC TGG CGG CTG CTG CTC ACA TCT GCT CCC 1745 
Arg Ser Asp Val Phe Glu Ala Trp Arg Leu Leu Leu Thr Ser Ala Pro 
535 540 545 

TCC CTG GCC ACC AGC CCC GCC TTC CGC TAC GAC CTG CTG GAC CTC ACT 1793 
Ser Leu Ala Thr Ser Pro Ala Phe Arg Tyr Asp Leu Leu Asp Leu Thr 
550 555 560 

CGG CAG CCA GTG CAG GAG CTG GTC AGC TTG TAC TAT GAG GAG GCA AGA 1841 
Arg Gin Ala Val Gin Glu Leu Val Ser Leu Tyr Tyr Glu Glu Ala Arg 
565 570 575 580 

AGC GCC TAC CTG AGC AAG GAG CTG GCC TCC CTG TTG AGG GCT GGA GGC 1889 
Ser Ala Tyr Leu Ser Lys Glu Leu Ala Ser Leu Leu Arg Ala Gly Gly 

585 590 595 

GTC CTG GCC TAT GAG CTG CTG CCG GCA CTG GAC GAG GTG CTG GCT AGT 1937 
Val Leu Ala Tyr Glu Leu Leu Pro Ala Leu Asp Glu Val Leu Ala Ser 
600 605 610 

GAC AGC CGC TTC TTG CTG GGC AGC TGG CTA GAG CAG GCC CGA GCA GCG 1985 
Asp Ser Arg Phe Leu Leu Gly Ser Trp Leu Glu Gin Ala Arg Ala Ala 
615 620 625 

GCA GTC AGT GAG GCC GAG GCC GAT TTC TAC GAG CAG AAC AGC CGC TAC 2033 
Ala Val Ser Glu Ala Glu Ala Asp Phe Tyr Glu Gin Asn Ser Arg Tyr 
630 635 640 

CAG CTG ACC TTG TGG GGG CCA GAA GGC AAC ATC CTG GAC TAT GCC AAC 2081 
Gin Leu Thr Leu Trp Gly Pro Glu Gly Asn lie Leu Asp Tyr Ala Asn 
645 650 655 660 

AAG CAG CTG GCG GGG TTG GTG GCC AAC TAC TAC ACC CCT CGC TGG CGG 2129 
Lys Gin Leu Ala Gly Leu Val Ala Asn Tyr Tyr Thr Pro Arg Trp Arg 

665 670 675 

CTT TTC CTG GAG GCG CTG GTT GAC AGT GTQ GCC CAG GGC ATC CCT TTC 2177 
Leu Phe Leu Glu Ala Leu Val Asp Ser Val Ala Gin Gly He Pro Phe 
680 685 690 

CAA CAG CAC CAG TTT GAC AAA AAT GTC TTC CAA CTG GAG CAG GCC TTC 2225 
Gin Gin His Gin Phe Asp Lys Asn Val Phe Gin Leu Glu Gin Ala Phe 
695 700 705 

GTT CTC AGC AAG CAG AGG TAC CCC AGC CAG CCG CGA GGA GAC ACT GTG 2273 
Val Leu Ser Lys Gin Arg Tyr Pro Ser Gin Pro Arg Gly Asp Thr Val 
710 715 720 
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GAC CTG OCC AAO AAG ATC TTC CTC AAA TAT TAC CCC GGC TGG GTG GCC 2321 
Asp Leu Ala Lye Lys lie Phe Leu Lye Tyr Tyr Pro Gly Tzp Val Ala 
725 730 735 740 

GGC TCT TGG TGATAGATTC QCCACCACTG GGCCTTGTTT TCCGCTAATT 2370 
Gly Ser Trp 



CCAGGGCAGA TTCCAGGGCC CAGAGCTGGA CAGACATCAC AGGATAACCC AGGCCTGGGA 2430 

GGAGGCCCCA CGGCCTGCTO GTGGGOTCTG ACCTGGGOGG ATTGGAGQGA AATGACCTGC 2490 

CCTCCACCAC CACCCAAAGT GTGGGATTAA AGTACTGTTT TCTTTCCACT TAAAAAAAAA 2550 

AAAAAAGTCG AGCGGCCGCG AATTC 2575 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 743 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

» 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



(ix) FEATURE: 

(A) NAME/KEY: Potential ly-glycoeylated Asn site, 

(B) LOCATION: 261 

(ix) FEATURE: 

(A) NAME/KEY: Potential ly-glycoafylated Aan site, 

(B) LOCATION: 272 

( ix) FEATURE : 

(A) NAME/KEY: Potentially-glycoeylated Asn site, 

(B) LOCATION: 435 

(ix) FEATURE: 

(A) NAME/KEY: Potentially^glycoaylated Asn site, 

(B) LOCATION: 503 

( ix) FEATURE : 

(A) NAME/KEY: Potentially-glycoaylated Asn site, 

(B) LOCATION: S13 



(ix) FEATURE: 
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U> NAME/KEY: Potent ially-glycosylated Asn eite, 
(B.) LOCATION: S26 

{ix} FEATURE: 

{A) NAME/KEY: Potential ly-glycoaylated Asn site, 
(B) LOCATION: 532 



Met Glu Ala Val Ala Val Ala Ala Ala Val Qly Val Leu Leu Leu Ala 
IS 10 15 

Gly Ala Gly Gly Ala Ala Gly Asp Glu Ala Arg Glu Ala Ala Ala Val 

20 25 30 

Arg Ala Leu Val Ala Arg Leu Leu Gly Pro Gly >ro Ala Ala Asp Phe 
35 40 45 

Ser Val Ser Val Glu Arg Ala Leu Ala Ala Lye Pro Gly Leu Asp Thr 
SO 55 60 

Tyr Ser Leu Gly Gly Gly Gly Ala Ala Arg Val Arg Val Arg Gly Ser 

65 70 75 80 

Thr Gly Val Ala Ala Ala Ala Gly Leu Hia Arg Tyr Leu Arg Asp Phe 

85 90 95 

Cye Gly Cys His Val Ala Trp Ser Gly Ser Gin Leu Arg Leu Pro Arg 
100 105 110 

Pro Leu Pro Ala Val Pro Gly Glu Leu Thr Glu Ala Thr Pro Asn Arg 
115 120 125 

Tyr Arg Tyr Tyr Gin Asn Val Cys Thr Gin Ser Tyr Ser Phe Val Trp 
130 135 140 

Trp Asp Trp Ala Arg Trp Glu Arg Glu He Asp Trp Met Ala Leu Asn 

150 155 160 

Gly He Asn Leu Ala Leu Ala Trp Ser Gly Gin Glu Ala He Trp Gin 

165 170 175 

Arg Val Tyr Leu Ala Leu Gly Leu Thr Gin Ala Glu He Asn Glu Phe 
160 185 190 

Phe Thr Gly Pro Ala Phe Leu Ala Trp Gly Arg Met Gly Asn Leu His 
19S 200 . 205 

Thr Trp Asp Gly Pro Leu Pro Pro Ser Trp His He Lys Gin Leu Tyr 
210 215 220 



Leu Gin His Arg Val Leu Asp Gin Met Arg Ser Phe Gly Met Thr Pro 
225 230 235 240 
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Val Leu Pro Ala Phe Ala Gly His Val Pro Glu Ala Val Thr Arg Val ' 

245 . .250 25S 

Phe Pro Gin Val Asn Val Thr Lye Met Gly Ser Trp GXy His Phe Aen 
260 265 270 

Cys Ser Tyr Ser Cys Ser Phe Leu Leu Ala Pro Glu Aep Pro lie Phe 
275 260 2BS 

Pro lie He Gly Ser Leu Phe Leu Arg Glu Leu He Lys Glu Phe Gly 
290 295 300 

Thr Asp His He Tyr Gly Ala Asp Thr Phe Asn Glu Met Gin Pro Pro 
305 310 .115 :^7C> 

t « 

Ser Ser Glu Pro Ser Tyr Leu Ala Ala Ala Thr Thr Ala Val Tyr Glu 

325 330 335 

Ala Met Thr Ala Val Asp Thr Glu Ala Val Trp Leu Leu Gin Gly Trp 
340 345 350 

Leu Phe Gin His Gin Pro Gin Phe Trp Gly Pro Ala Gin He Arg Ala 
355 360 365 

Val Leu Gly Ala Val Pro Arg Gly Arg Leu Leu Val Leu Asp Leu Phe 
370 375 380 

Ala Glu Scr Gin Pro Val Tyr Thr Arg Thr Ala Ser Phe Gin Gly Gin 
385 390 395 400 

Pro Phe He Txp Cys Met Leu His Asn Phe Gly Gly Asn His Gly Leu 

405 410 415 

Phe Gly Ala Leu Glu Ala Val Asn Gly Gly Pro Glu Ala Ala Arg Leu 
420 425 430 

Phe Pro Asn Ser Thr Met Val Gly Thr Gly Met Ala Pro Glu Gly He 
435 440 445 

Ser Qln Asn Glu Val Val Tyr Ser Leu Met Ala Glu Leu Gly Trp Arg 
450 455 460 



Lys Asp Pro Val Pro Asp Leu Ala Ala Trp Val Thr Ser Phe Ala Ala 
465 470 475 480 

Arg Arg Tyr Gly Val Ser His Pro Aep^Ala Gly Ala Ala Trp Arg Leu 

465 490 495 

t«eu Leu Arg Ser Val Tyr Asn Cys Ser Gly Glu Ala Cys Arg Gly His 
500 505 510 

Asn Arg Ser Pro Leu Val Arg Arg Pro Ser Leu Gin Met Asn Thr Ser 
515 520 525 
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lie Trp Tyr Asn Arg Ser Asp Val Phe Glu Ala Trp Arg Leu Leu Leu 
530 53S S40 

Thr Ser Ala Pro Ser Leu Ala Thr Ser Pro Ala Phe Arg Tyr Asp Leu 
545 550 555 560 

Leu Aap Leu Thr Arg Gin Ala Val Gin Glu Leu Val Ser Leu Tyr Tyr 

565 570 575 

Glu Glu Ala Arg Ser Ala Tyr Leu Ser Lye Glu Leu Ala Ser Leu Leu 
580 585 590 

Arg Ala Gly Gly Val Leu Ala Tyr Glu Leu Leu Pro Ala Leu Asp Glu 
5oe; f^nn ety^ 



Val Leu Ala Ser Asp Ser Arg Phe Leu Leu Gly Ser Trp Leu Glu Gin 
610 615 620 

Ala Arg Ala Ala Ala Val Ser Glu Ala Glu Ala Asp Phe Tyr Glu Gin 
625 630 635 640 

Asn Ser Arg Tyr Gin Leu Thr Leu Trp Gly Pro Glu Gly Asn lie Leu 

645 650 655 

Asp Tyr Ala Asn Lys Gin Leu Ala Gly Leu Val Ala Asn Tyr Tyr Thr 
660 665 670 

Pro Arg Trp Arg Leu Phe Leu Glu Ala Leu Val Asp Ser Val Ala Gin 
675 680 665 

Gly lie Pro Phe Gin Gin His Gin Phe Asp Lys Asn Val Phe Gin Leu 
690 695 700 

Glu Gin Ala Phe Val Leu Ser Lys Gin Arg Tyr Pro Ser Gin Pro Arg 
705 710 715 720 

Gly Asp Thr Val Asp Leu Ala Lye Lys lie Phe Leu Lys Tyr Tyr Pro 

725 730 735 

Gly Trp Val Ala Gly Ser Trp 
740 



(2) INFORMATION FOR S£Q ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiens 

(viii) POSITION IN 6ENC»^: 

{A} CHROMOSOME/SEGMENT: Chromosome 17 

(ix) FEATURE: 

(A) NAME/KEY: exon 1 
<B) LOCATION: 990.. 1372 

(ix) FEATURE: 

(A) NAME/KEY: exon 2 

(B) LOCATION: 2115.. 2262 

(ix) FEATURE: 

(A) NAME/KEY: exon 3 

(B) LOCATION: 3056.. 3202 

(ix) FEATURE: 

(A) NAME/KEY: exon 4 

(B) LOCATION: 3387.. 3472 

(ix) FEATURE: 

(A) NAME/KEY: exon S 

(B) LOCATION: 5667.. 5923 

(ix) FEATURE: 

(A) NAME/KEY: exon 6 

(B) LOCATION: 7745.. 8955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATAATGAGCA GTGAGGACGA TCAGAGQTCA CCTTCCTGTC TTGGTTTTGG CAGGTTTTGA 



60 



CCAGTTTCTT TGCTOCATTC TGTTTTATCA GCGQGGTCTT GTOACCTTTT ATCTTGTGCT 120 

GACCTCCTGT CTCATCCT6T GACGAAGGCC TAACCTCCTG GGAATTCAGC CCAGCAGGTC 180 

TCTGCCTCAT TTTACCCAGC CCCTGTTCAA GATGGAGTCG CTCTGGTTGG AAACTTCTGA 240 

CAAAATGACA GCTCCTGTTA TGTTGCTGCT GCTGCC6CCA ATGGACAGCC TTTAACGTGC 300 

CCQCCAGCCC TGCTCCACCG CCOGCCTGOG CTCACATGGC CCCATCCCtC CTCQAACCTC 360 



CTAGCCTGTT AGTTACTCAA ATCTGCAAGC TCTCTGCCTT CTCAGGGCCT TCAATAAATG 420 

CATTTCTTCT GTCTGGAAGG CTCTTCCTTT CCCTCTTCTA GCCAATTCCT ATTCATCCCT 480 

GAGTTTCAGA TTAAAAQTCA CTTCCTTTGG AAACCTTACT TCGCTACTTC GCTACTTACT 540 

GCACTACTTC GCAGCATCAC AACTATGATG 6AAATCCTTA CTTACGTTAA ATATCTGGTT 600 
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TCTAGGTCAC CTCCCTGACG GGGACGGTAG GGACCX5TCTT CTCGTTCATC AGTAGGGAAG 660 

TAGCTATGGC AQTGCCTGAT ACAAAATAAA CTCCAAATGT GTATTTATTA GATGGTTGGA 720 

TGGAAGTTAT TTGCGTGTGA AAGCGCGTTT TACCCGAAGG CGCTCTGTGA GGGCCAGCGG 780 

GTCCCCTTCG GCCCTGGAGC CGGGGTCACA CGCTCCCCAC CGCGTOCGGT CACGAGACGC 640 

CCCCAAGGGA GTATCCTGGT ACCCGGAAGC CGCGACTCCT GGCCCTGAGC CCGGGCTTAG 900 

; CCTTCGGGTC CACGTGGCCG GAGCCGGCAG CTGATTGGAC GCGGGCCGCC CCACCCCCTG 960 

GCC6TCGCGG GACCCGCAGG ACTGAGACCA TG6A6GCGGT GGCGGTGGCC GCGGCGOTGG 1020 

GGGTCCTTCT CCTGGCCGGG GCCG6GGGCG' CGGCAGGCGA CGAGGCCCGG GAGGCGGCGG 1080 

CCGTGC6GGC GCTCGTGGCC CGGCTGCTGG GGCCAGGCCC CGCGGCCGAC TTCTCCQTGT 1140 

CGGTGGAGCO CGCTCTGGCT GCCAAGCCGG GCTTGQACAC CTACAGCCTG GGCGGCGGCG 1200 

GCXSCGGCGCG CGTGCGGGTG CGCGGCTCCA CGGGCGTGGC GGCCXJCCGCG GGGCTGCACC 1260 

GCTACCTGCG CGACTTCTGT GGCTGCCACG TGGCCTGGTC CGGCTCTCAG CTGCGCCTGC 1320 

CGCGGCCACT GCCAGCC6TG CCGGGGGAGC TGACCGAGGC CACGCCCAAC AGQTACCGCC 1360 

CCGAAGCTTC CCCOCGTCCG CCCOAOGCOC TTACCCCCTC CCGOAGCCQC T6CCACCCAA 1440 

ATCGGGAGGC TGAGCGGGGA GCGCTGQCCG GAAGGCCCAQ CTGCGCCGCC TCCAGCAGCT 1500 

GTGTGGCCTT QAGCCAGCCA CTCTGCCTTT CAGAGCCTCG GCTGGCCCAC CTGAAAAACQ 1560 

GAAAGAASAC GCCTACCGTG CAGTGTTATT OTGAGGATTT GCACGATGAT GGGCATA0AA 1620 

TTTGTGGTGC ACAATTGGTG ATGAGTOAAT TTTCTTGCCT TCCTCCCCCA CCTTCTCTTT 1680 

GAACCTGCGG ACTGAGGAAG 6AC6CCTCCA TCCCCCACCC TACAGGCCTG TGTTCCAGC6 1740 

CCTGCXTACAC TATOGAGTGA TOTQTTCACA CAGCTGTCCT CCCCTGCCCA TCTGTTAOAC 1800 

TGTGGGGGCA GGGATTCCCC GTTCCAQGAA AACACCQTGC AGAGOAGGOG CTCTG6CA6T ' 1860 

GTGGCATGAA AGTGGAATAX GCCACCCAAA TACCCGCCAG GCTAGAGGGC CCTGGGAGAG 1920 

TGCAOGGGAC GAGTGCCTCA QAAGCCCAGC CCCQ6TACCT GGTCTCAGCT CCACCTGGGQ 1980 

TGGGTCCCAG TGTGCAQCAO AAGGGCCGAG TTTGGAGCCC CTCCCCTCTC CTCTAGGTGQ 2040 

GGGATGGGGG ATTTGTTCCA GGGCCGTQGA CCCTCCAGGG TGGQATGCGC CCCTGCTCAT 2100 

GACACTGCCC GCAGGTACCG CTATTACCAG AATGTGTGCA CGCAAAGCTA CTCCTTCGTO 2160 

TG6TGQGACT GGGCCCQCTG GGAGCGAGAG ATAOACTGGA TGGCGCTGAA TGGCATCAAC 2220 
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CTGGCACTGG CCTGGAGCGG CCAG0A30CC ATCTQGCAQC GGGTGCGTGC CCACTGTCCC 2280 

TTCCCCACCC TCCTCTATGG CGGGAGCCAC CGTAGGTGTT TTCACCCGCC CCCCAGCATG 2340 

GGCGCAGTGT CTCTCTCTAG AAGTGCTTTC AGCGTGCACA GTGGCTTQGG CCTCCTAAAA 2400 

ACTGAGGCTT CCGQCCQGGC GCGGTGGCTC ACGCCTGTCA TCCCAGCACT TCGGGAGGCC 24 60 

TAGGCGGGCG GATCAGGAOT TCAGGAGATC GAGACCATCC TGGCCAACAT TGTGAAACCC 2520 

CGTCTCTACT AAAATACAAA GAAATAGCAA CCTGGGCAAC AGAGCGAGAC TCTGTCTAAA 2580 

AAAAAAAAAA AAAAAAACTG AGGCTTCCAG TTTGAGGAGT GQGGCTCCTT CCCCCATCTC 2640 

CCCTATGCAG CCAATCACCT GGTCCCTTGG ATCCAACTCA TGGGCAGCTC TAGATCTGCC 2700 

TCCCTSQAAG CTTCTGTGCT GCAATOGCTG CTCCAGGCTC TGCTTAAGCT CTTCACACAG 2760 

TTGCCCTGCC CTTCCATCTG GCACTCTTGC TCCATGAAGC CTTCTAAGGC CTTCCTGTTG 2820 

GGGGAAAGCC CCTTTGTGCC CCATCTCCTC ACCCATGCGA CAAA6GCAAC ACAGTGAACT 2880 

CACCTACTCA CAGQTCTCTT TCCTCTQGGC TGTGGQCTCC TTGATGGCAG CGTTCGGATT 2940 

TTGTCTCAGT AGCCCTAGCA CCCAGCACAA AQAAGCAATG AGTOAATGQT TGTTGAATGA 3000 

ATGAATGAAT GAATGAAGAT GAATATATTT CTATGTGTGG GCCCTTCTTC CTCAGGTGTA 3060 

CCTGGCCTTG GGCCTGACCC AGGCAGAGAT CAATGAGTTC TTTACTGGTC CTGCCTTCCT 3120 

GGCCTGGGGO CGAATGGGCA ACCTGCACAC CTGOGATGGC CCCCTGCCCC CCTCCTOGCA 3180 

CATCAAGCAG CTTTACCTGC AGGTAAAAGG ATGGAAAAQG GAAGGGGCAG AATCGGTGAT 3240 

AGATGGTCAT QGGCCCAGGA AGGGTGGTAT TAQGCCGGCC CCAGQGCTCT TAACTGAGGC 3300 

OGGGGGCTGC GTGTATCCTG GGA0ATGAGG GCCTTCTCAT AGGACAQCAG TOGCCATGCT 3360 

CACCACCCTT CCTTCTOTTC CTCCAOCACC GGGTCCTGGA CCAGATGCGC TCCTTCGGCA 3420 

TGACCCCAGT GCTGCCTGCA TTCX3COGGGC ATQTTCCCQA GGCTGTCACC AGGTGAGGTT 3480 

CCQCTCACCC CCTCCACTTA GCTCAGAQAQ OGAATTTTAT TCCCTTCTAG AACATQACTT 3540 

AAAAACTTAA GCTCTGGGCC 6GGCGCAGTG QCTCACGCCT 6TAATCCCAQ CACTTTGQGA 360O 

GGCCGAGTTX3 GGCGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTGGC CAACATGGTG 3660 

AAACCCTQTC TCTACTAAAA ATATAAAAAT TAGCTGGGCA TGGTGGCACG CGCCT6TAAT 3720 

CCCATCTACT TAGOAGGCTG AQACAOGAGA ATTGCTTAAA CCTG6GAGGC AGACGTTGCA 3780 

GTGAGTCAAG ATCACGCCAT TOCACTCCAG CCTGGGTGAC GAGCGAAACT CTGTCTCAAA 3840 
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CAAACAAACA AGCTCTGGAC GTAGGCCTQO GTTTGATTTC TGACTCTC5CT ACTAATTAQC 3900 . 

TGTGTGACTT CGGGCAGATG ACATGACTGC TCTGTGCCTC AGTTTCCTTA CTT6TAAAAT 3960 

GGGATCTCTA CCCACTTCQC TGTAOGGTTT GTAATTATCT CTCGATCTAT CTGTGACTTT 4020 

GCACAGAGTG CTAGCAAATG GCAGCCCTTG GGAGTGGCA6 CAGOGGTGCT CCAGTGTCCC 4080 

TTGTCCCTCC TGTTCCTCTG TGCTTCCCAG CCATCCTCTC ACATGTGGTT GGGAAAAGTC 4140 

TTCAAGGCTC ACCTGAGACC TCCCCTCCTT CAGGAAGCCT TGCTAGTGCC CCGCATGACC 4200 

TCCTTTGCAC CTGCTAATGT CTGGCTCCCA TACTCTCGTA GGACTXAATG CATGCCAGTG 4260 

GCCTCCCTGC CCGCCTCTTT GCCCCCATCA -GCAGGTGGCA 6GAAACTCAC TCATTCATTC 4320 

AATAAACTTG GTCCAGCTGT CTGAGGCTGC CAGAACTGGC TGTGCTGGGT CCTGGQAGGC 4380 

GGCAAGAAAG GTGCCCAAGG GCTTACCCCT GATAGGAGAG ATATGTTGGC TGAAGGATAC 4440 

AATGTGGGGA CAAGGACAGG AATATATGTG GGTTCCGCTC TCCTCTGCCG GGAGAGAGGQ 4500 

GCAGGAAGGG CTCAGGGCAG AGCCCAGCCT TGAAAAATQA GTGTTQCTTG GACGQACGCT 4560 

TGGCTAATGC TTGTAATCCT AGCGTTTTGG GAOGCTGAGG CQTATGGATC ACCTGCGGTC 4620 

AGGAGTTAAA GACCAGCXTTG GCCAACATGG C6AAACCCCA TCTCTACTAA AAGTACAAAA 4680 

ATTAGCCAGG CGTGGTGGCG GGCTCCTGTA ATCCCAGCTA CTCGGTAGGC TGAGGCATGA 4740 

GAATCTCTTG AAGCCAGGGG CCAGAGACTG CAGTGAGCCQ AGATCACACC ACTTCACTCC 4800 

AGCCTGGGTG ACA6AGTGAG ACTCCGTCTC AAAAAAAAAA AAAAAAAAA6 GAAAGAAAAT 4860 

TAAACACCTC ATGTTCTCAC TCATAGTGGG AGTTGAACAA TGAGAACAAC ATGGACACAG 4920 

GAAGGGGAAC ATCACACACC GGGGCCTXTC OCGGTOTGGG 6GTCAAGGGG AGGAGTAGCA 4980 

TTGGGACAGA TACTTAATGC ATGCGGGGCT GAAAACCTAG ATGATGGOTT GATGG6T6CA 5040 

GCAAACCACC ATGGCACATG TATACCTATG CAACAAACCT GCATGTTCTG CACAGAACTG 5100 

AACTGAAAGT ATAATTAAAA AAAAAAAAAA AAGCTGGGTG CGGTGGCCCA CACCTGTAAT S160 

CCCAGCACTT TGGGAGGCCG AOACGGGCOa ATCACAAGGT CAQCAGATCG AGACCATCCT 5220 

GGCTAACACA GTGAAACTCA GTCTCTACTA AAAATACAAA AAATTAGCCG GGTGTGGTGG 5280 

CGGGCACCTG TAGTCCCAGC TACXAGGGAG GCTGAGGCAG GAGAATGGCA TGAACCTGGG 5340 

AGGCAGAGCT TGCAGTGAGC TGAGAATGCQ CCACTQCACT CCAGCCTGGO GGACAGAGTG 5400 

AQACTCTGCC TCAAAAAAAA AAAAAAAAAG AAAGAAAAAG QAJSCGTTGCT TGTTTCAGGC 5460 
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CACAGGAAGG GGAOAGATAG TGAAAGrTTT TCAGAGAAGG TGGCCAGGGA AGGAGAAGAA 5520 

AGGACTGTAG GCAGAGAGCA TAGCCTGTAC AAAGCCATAG AGGCAAGAGA AACCA6GAQC 5580 

TGTAOAGAAG TTOGCAAQGC TOTTGAACAC TATQQTOAAC ACTATOGCGG CTTCCATGAA 5640 

ATATCTGAGC TTTTGCTCCC CACTAGGGTG TTCCCTCAGG TCAATGTCAC GAAGATGGGC 5700 

AGTTGGGGCC ACTTTAACTG TTCCTACTCC TGCTCCTTCC TTCTGGCTCC GGAAGACCCC 5760 

ATATTCCCCA TCATCGGGAG CCTCTTCCTG CGAGAGCTGA TCAAAGAGTT TGOCACAGAC 5820 

CACATCTATG GGGCCGACAC TTTCAATOAG ATQCAOCCAC CTTCCTCAGA GCCCTCCTAC 5880 

CTTGCCGCAG CCACCACTGC CGTCTATGAG GCCATGACTG C7W3GTACAGT GCCTGGGTGG 5940 

GGTGGGAGAG CCCCCCAOAC CCTCAAAAAG AAGQGAOTAG CAGATX5TCAG TAGGGGTAGG 6000 

CAGAGGGACT QGAATAATGC CTCGCCATAA CACACAGTAC TTTATAGTTT ACCAAGCACG 6060 

TGTACACATG CX5TTGTCTCA GTGAATCCCA CTGTGGTTGA GAGGTGAGCT CTGGAAGCCA 6120 

ACAACCTGGO TCACACCTCG CGCTCCTATT TCCrTGGCCGT GTGACTTATG ACTCATGACC 6180 

TCCTTCCCAG TGTCTCGTTT OCTTTTCCTG TAAACT6Q0A CTACCTCATA GGTAGAATAA 6240 

CGCCTGGCCC AGAGCAAAGG CCACTAAGAG CTAGCTATGA ACAAGGATTT TGTTTCATCT 6300 

CTGCGTGGTT GCTGAAGTAG GCACTGCAGG CAGGAGQTQA GTGGATGTGC CTAAAGGCAC 6360 

TAAGTGCGCA TCCTGCTACA AAACTGT6AA GCCAQQOCTC CTTCCTGCCA CTTAAAGOAG 6420 

GAGTQOAGCA 6AQGGCQCCC AAOTCAGQAA TGACTTAOTG 0AGA6GCGTC TGTGTTOOCC 6480 

AOGAAGGGAA CAGATCAGCT CAQCCTTTCT TGAGCAQTAC TQCTCCAAGT GTGACCCAAA 6540 

ACCAGCA6CA GCAGCAGCAG CAGCJCCGAGC TGTGAGAXGG CAAATTCTCA GGCCCTACCC 6600 

AAOACCTQAA GGAGAAGCTA CATTTTTTTT TTTTTTQAOA CAQATTTCAC TCTGTTGCTG 6660 

AGGCTGGAGC ACAOTGGCAC AATCTCATCT CACTGCAACC TTCGTCTCCT AGGTTCAAGC 6720 

OATTCTCCTG CCTCAGCCTC CCQAQTAGCT GGGACTATAG GCACCCOCCA CCACQCCCGG 6780 

CAATTTTTQT TTGTTTT6AG ATAGAGTCTC GCTCTGTCAC CCAGGCTGGA 6TGCAGTGGC 6840 

ACGATCTCAfi TTCACTGCAA CCTCTGCTTC CTGAGTTCAA GCGATTCTCC TGCCTCAGCC 6900 

TCCTGAGTAG CTOGGATTAC AGGCGCCCCC CAACCACACT CGGCTAATTT TTGTATTTTT 6960 

AGTAOAOACQ GGGTTTCGCT ATQTAGOTCA AGCTGQTTTC AAACTCCTGA CCTCAAATGA 7020 

TTCGCCCACT TCAGCCTCCC AAAGTGCTOG GATTACAGGT GTGAGCCACC TTGCCTGGCC 7080 
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AATTTTTGTA TTTTTAGTAG AAACAGGTTT CACCATGGTG GCCAGACTGG TCTCAAACTC 714 0 
CTGACCTCAG GTGAACTGCC CACCTCAGCC TCCCAAAGTA CTGGTATTAC AGGCGTOATC 7200 
CACTGCQACT GGCCTTGATT TTGTTTTTGA GACAGAATCT TACTCTGTCG CCCAGACTGG 7260 
AGTGCAGTGG CACAATCTCA GCTCACTGCA ACTTCTGCCT CATGGGTTCA AGTGATTCTT 7320 
QTGCCTCTAC CTCCCGAGTA GCCGGOATTA CAGGCACCTQ CCATTACGCT AGGCTAATTT 7380 
TTGTATTTTT AGTATAGACA GGGTTTCCCC ACATTGGCCA GGCTGGTCTG 6AACTCCTGG 7440 
GCTCAAGTGA TCCACCTGCT TCAGCCCCTC A6AGTACTGG GATTATAGGT GTGGQCCACC 7500 
ACGCCCATTC AGAAACCTCC ATGTTTTAAG' OAGCCCTCTO GGTAACTCTC ATGTTCACCC 7560 

AAGCTGCTGA ACCCTGTCCT GQAGTTTTCA OAGGGACGCG TATGTGCCAC AGAGCGTCCC 7620 

GCTGGTGGGG GTCATGGGAA GCCAT6ACCT GGGATAGACA GTCGTCTGTA GAGTGGOQTG 7680 

AACATTCCCT GGGCCCTCTG TTTCATCACT CCTCTTCTCT GTTCCCCCTA CCTCCTGTCC 7740 

ACAGTGGATA CTGAOGCTGT GTGGCTGCTC CAAGOCTGGC TCTTCCA6CA CCAGCCGCAG 7800 

TTCTGGGGGC CCGCCCAQAT CAGGGCT6T6 CT6GGAGCTG TGCCCCQTGG CCOCCTCCTG 7860 

GTTCTGGACC TGTTTGCTGA GAGCCAGCCT GTGTATACCC GCACTGCCTC CTTCCAGGGC 7920 

CAGCCCTTCA TCTGGTGCAT GCTGCACAAC TTTGGGGGAA ACCATGGTCT TTTTGQAOCC 7980 

CTAGAGQCTG TGAACGGAGG CCCAGAAOCT GCCCGCCTCT TCCCCAACTC CACCATGGTA 8040 

GGCACGGGCA TGGCCCCCGA GGGCATCAGC CAGAACGAAG TGGTCTATTC CCTCATGGCT 8100 

GAGCTGGGCT GGCGAAAGOA CCCAGTGCCA GATTTGGCAG CCTGGGTQAC CAGCTTTGCC 8160 

GCCCQOCGOT ATGGGGTCTC CCACCOGGAC GCAGGGOCAG CGTGQAOQCT ACT6CTCCGG 8220 

AGTGTQTACA ACTGCTCCGO QQAGGCCTQC AGGGGCXTACA ATCGTAGCCC GCTGQTCAGG 8280 

CGGCCGTCCC TACAGATGAA TACCAGCATC TGGTACAACC GATCTOATGT GTTTGAGGGC 8340 

TGGCGGCTGC TGCTCACATC TGCTCCCTCC CTOGCCACCA GCCCCGCCTT CCGCTACOAC 8400 

CTGCTGGACC TCACTCX5GCA GGCAOTGCAQ QAOCTOGTCA GCTTGTACTA TGAGGAGGCA 8460 

AGAAGCGCCT ACCTGAGCAA GGAaCTGGCC TCCCTGTTQA GOQCTOGAGG CGTCCTG6CC 8S20 

TATGAGCTGC TGCCGGCACT GGACOAGGTG CTGGCTAGTG ACAOCCGCTT CTTGCTGGOC 85B0 

AOCTGGCTAG AGCAGGCCCG AQCAGCGOCA OTCAGTGAQO CCGAGGCCGA TTTCTACGAG 8640 

CAGAACAGCC GCTACCAGCT GACCTTOTOG GGGCCAGAAG GCAACATCCT GGACTATGCC 8700 
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AACAAGCAGC TQGCGGQGTT OGTGGCCAAC TACTACACCC CTCGCTGQCO GCTTTTCCTG 8760 

OAGOCGCTGO TTGACAGTQT GGCCCAGGGC ATCCCTTTCC AACAGCACCA GTTTGACAAA 8820 

AATGTCTTCC AACTGGAOCA GGCCTTCGTT CTCAGCAAGC AGAGGTACCC CAQCCAGCCG 8680 

CGAGGAGACA CTGTGGACCT GGCCAAOAAG ATCTTCCTCA AATATTACCC CGGCTGGGTG 8940 

GCCGGCTCTT OGTGATAGAT TCGCCACCAC TGGGCCTTGT TTTCCGCTAA TTCCAGGGCA 9000 

GATTCCAGGG CCCAGAGCTG GACAGACATC ACAGGATAAC CCAGGCCTGQ GAGGAGOCCC 9060 

CACGGCCTGC TGGTGGGGTC TGACCTGGGG GGATTGGAGG GAAATGACCT GCCCTCC7VCC 9120 

ACC/ICCCAAA 6TGTGGGATT AAAGTACTGT TTTCTTTCCA CTTAAACTGA TGAGTCCCCT 9180 

GGGTCTQTCA AAATGAGAAG GTCACTGCTG CCACGCTTGG GAOGACTCAG GGCTATAGCA 9240 

TGGCCCTGGG GTGGGACCTG TTCTCCCATC CCTTGCCTCA CGTCCCfOTT TTTGTTTGTT 9300 

TGTTTGTTTG TGACGGAGCC TTGQTCTGTT GCCCAGGCTT GAGTACAATG GCACAGTCTC 9360 

GGCTCACTGC AACCTCCGCC TCXrTGGGTTC AAGCAATTCT TOTaCCTCAG CCTCCCCGGT 9420 

AGCTGGGACT ATAGGCATGC ACCACCACAC CAGGCTAATT TTTTTTTTTC CAAGATGGAG 9480 

TCTTGCTCTG TCGCCCAGGT TGQAGTTTAG TGGCACCATA TTGGTTTACT GCAACCTCTG 9 540 

CCTCCCGGGT TCAAGCAATT CTCCTGCCTC AGTCTACCAQ OaAGTTAGGA CTACGGGCCT 9600 

GTGCCATCAC GCCTGGCTAA TTTTTOTATT TTTCATAGAG ATAAGGTTTC ACCATGTTGG 9660 

CCAGGCTGGT CTTTAACTCC TOAACTCAAO TGATCCACCT GCCTCQGCCT TCCAAAGTGC 9720 

TGGGATTACA GGAGTGA6CC ACCGTGCCCQ GCCATQTCTC TCTTTTTAAC ACTAATGTTA 9780 

CCCTGACCTT TGAACGTAGA ATGCCCTTCT GTTGCAGGAA AACCTCTTTT CAAACCATGT 9840 

TTGTCCTTTG CTGGCATGCC ACAQCAACAG TCACCAACAC AGAAGACTTC TGTGACCAAA 9900 

TATTTGGAGO ATTTTCCCCA CACACACCAA GCAGCAGACA TCAGCTOGGT OTCCTCCAAT 9960 

TCAGTTCCAA TGTAATCAAC CAGAGACAQC ATCAGATCCC ACAGGGTTAG GGTGCAQATC 10020 

CATGAGACCA CCCCCTCCTT CCCAACGQTT ACAAGTCCTG ATCCCTGGAA CTTCTGACTA 10080 

ACTG6CTTCA AGTTGQAGTT CCCATOACCC CCTTCCCCTC TTTGGAGTCA ACTCATTTOC 10140 

GACAQTOACC CACGAAACAC AGGGAAACCC TTATtATQTT TATTGCTTTA TTACAQAGGA 10200 

AAAAAATTTT TTTCTTTCTT TTTTGAGACA GGGTCTCACT CTQTCATCXIA 6AATGACTGC 10260 

AQTGGCAGGA TCTGGCTCCX3 TCACCCAGQC TGGAOTGCAG TGGCAT6ATC TCX^GCTCACT 10320 
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ACAGCCTCCA TCCCCCCCAA ACCCCACGCC TCAGCGCCCC ACCCCGCAAG TGGCTQGGAC 10380 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(6) TYPE: amino acid 
(D) TOPOLOGY; linear 

r • 

(iii) HYPOTHETICAL: NO 

<v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



Ux) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Asp Glu Ala Arg Glu Ala Ala Ala Val Arg Ala Leu Val Ala Arg Leu 

1 5 10 15 

Leu Gly Pro Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: HO 

(v) FRAGMENT TYPE: N-texminal 



(vi) ORIGINAL SOURCE: 
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( A) ORGANISM: Homo eapiens 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site, glycoaylated or phosphorylated, 

wherein Xaa may be any amino acid residue, preferably Arg. 

(B) LOCATION: 16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Pro Gly Leu Asp Thr Tyr Ser Leu Gly Gly Gly Gly Ala Ala Xaa Val 
15 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE; internal 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site, glycosylated or phosphorylated, 

wherein Xaa may be any amino acid residue, preferably Ala 

(B) LOCATION: 12 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site, glycosylated or phosphorylated, 

wherein Xaa may be any amino acid residue, preferably Ser 

(B) LOCATION: 14 



(xi) SEQtJENCE DESCRIPTION: SEQ ID N0:€: 



Trp Arg Leu Leu Leu Thr Ser Ala Pro Ser Leu Xaa Thr Xaa Pro 
15 10 15 
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CLAIMS: 

1 . An isolated nucleic acid molecule comprising a sequence of nucleotides which encodes 
or is complementary to a sequence which encodes a mammalian a*N- 
acetylglucosaminidase or fragment or derivative thereof. 

2. The isolated nucleic acid molecule according to claim 1 wherein the nucleotides are 
deoxyribonucleotides. 

■ • 

3. The isolated nucleic acid molecule according to claim 2 wherein said molecule is 
cDNA. 

4. The isolated nucleic acid molecule according to claim 2 wherein said molecule is a 
genomic DNA molecule. 

5. The isolated nucleic acid according to any one of claims 1 to 4 wherein the mammal 
is a human. 

6. The isolated nucleic acid according to claim 5 wherein the a-N*acetylglucosaminidase 
is of liver, kidney or placenta origin. 

7. The isolated nucleic acid molecule according to any one of claims 1 to 6 having a 
nucleotide sequence substantially as set forth in SEQ ID N0:1 or complementary thereto 
or having at least 40% similarity to all or part thereof. 

8. The isolated nucleic acid molecule according to any one of claims 1 to 6 having a 
nucleotide sequence substantially as set forth in SEQ ID No:3 or complementary thereto 
or having at least 40% similarity to all or part thereof. 

9. The isolated nucleic acid molecule according to claim 7 or 8 wherein the percentage 
similarity is at least 60%. 
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10. The isolated nucleic acid molecule according to claim 9 wherein the percentage 
homology is at least 80%. 



1 1 The isolated nucleic acid molecule according to any one of claims 1 to 1 0 wherein 
the o-N-acetylglucosaminidase or fragment or derivative thereof encoded by said molecule 
comprises an amino acid sequence substantially idwitical to SEQ ID No:2 or is at least 40% 
similar to all or a part thereof 

12. The isolated nucleic acid molecule according to claim 1 1 wherein the percentage 
similarity to SEQ ID No:2 is at least 60%. 

13. The isolated nucleic acid molecule according to claim 12 wherein the percentage 
similarity to SEQ ID No:2 is at least 80%. 

14. The isolated nucleic acid molecule according to any one of claims 1 or 13 wherein 
said molecule is carried by a vector capable of replication in a eukaryotic cell and/or a 
prokaryotic cell. 

15. The isolated nucleic acid molecule according to claim 14 wherein the vector is an 
expression vector. 



16. The isolated nucleic acid molecule according to claim 1 5 wherein the expression 
vector is capable of being expressed in cells derived from a eukaryote. 

1 7. The isolated nucleic acid molecule according to claim 1 6 wherein the expression 
vector is further capable of being expressed in cells derived from a mammal. 

18. The isolated nucleic acid molecule according to claim 17 wherein the expression 
vector is further capable of being expressed in CHO cells. 
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1 9. A recombinant mammalian a-N-acetylglucosaminidase or fragment or derivative 
thereof. 

20. The recombinant mammalian a-N-acetylglucosaminidase according to claim 19 
in substantially pure form. 

2 1 . The recombinant mammalian a-N-acetylgiucosaminidase according to claims 1 9 
or 20 when expressed in mammalian, yeast or insect cells. 

t m 

22. The recombinant mammalian a-N-acetylglucosaminidase according to claim 21 
when expressed in mammalian cells. 

23. The recombinant manmialian a-N-acetylglucosaminidase according to claims 21 
or 22, wherein the cells are capable of glycosylating said recombinant mammalian a-N- 
acetylglucosaminidase. 

24. The recombinant mammalian a-N-acetylglucosaminidase according to claim 23 
wherein the cells are capable of N-glycosylating said recombinant mammalian a-N- 
acetylglucosaminidase. 

25. The recombinant mammalian a-N-acetylgiucosaminidase according to claim 24 
wherein the cells are CHO cells. 

26. The recombinant mammalian a-N-acetylglucosaminidase according to any one of 
claims 19 to 25 wherein said recombinant a-N-acetylglucosaminidase is in a glycosylated 
form. 

27. The recombinant manmialian a-N-acetylglucosaminidase according to claim 26 
wherein the molecular weight of the glycosylated form as determined using SDS/PAGE 
is at least approximately 79 kDa. 
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28. The recombinant a-N-acetylglucosaminidase wherein the molecular weight of the 
glycosylated form as detennined using SDS/PAGE is at least approximately 79 kDa to 89 
kDa. 



29. The recombinant mammalian a-N-acetylglucosaminidase according to any one of 
claims 19 to 28 comprising a sequence of amino acids substantially the same as a human 
a-N-acetylglucosaminidase. 



30. The recombinant mammalian ti-N-acetylglucosaminidase according to any one of 
claims 19 to 29 when fused to another proteinaceous molecule. 

3 1 The recombinant mammalian a-N-acetylglucosaminidase according to claim 30 
wherein the other proteinaceous molecule is an enzyme, reporter molecule, purification site 
and/or a signal sequence. 

32. The recombinant mammalian a-N-acetylglucosamioidase according to any one of 
claims 19 to 3 1 comprising an amino acid sequence substantially as set forth in SEQ ID 
N0:2 or having at least 40% similarity to all or part thereof 

33. The recombinant mammalian o-N-acetylglucosaminidase according to claim 32 
wherein the percentage similarity to SEQ ID N0;2 is at least 60%. 



34. The recombinant mammalian a-N-acetylgiucosaminidase according to claim 33 
wherein the percentage similarity to SEQ ID N0:2 is at least 80%. 

35. A recombinant o-N-acetylglucosaminidase when produced by expression of a 
nucleic acid molecule according to anyone of claims 14 to 18. 

36. The recombinant a-N-acetylglucosaminidase according to claim 35 when 
glycosylated. 
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37. A method of diagnosing a mutation in a gene which encodes a-N- 
acetylglucosaminidase in a human patient said method comprising contacting genomic 
DNA or RNA derived from said patient with one or more isolated DNA molecules or 
oligonucleotides comprising at least 10 contiguous nucleotides derived from SEQ ID No: 1 
or SEQ ID No: 3 or a complementary strand thereof for a time and under conditions 
sufficient for hybridisation to occur and then detecting said hybridisation using a detection 
means. 



38. The method according to claim 37 wherein the detection means is a reporter 
molecule covalently attached to the isolated DNA molecule or oligonucleotide. 

39. The method according to claim 38 wherein the reporter molecule is ^^P, "S or 
biotin. 



40. The method according to claim 37 wherein the detection means is a polymerase 
chain reaction format. 

4 1 . The method according to claim 40 wherein the polymerase chain reaction format 
is selected from the list comprising SSCP, AMD, AFLP, IRS-PCR, iPCR or RT-PCR, 
amongst others. 

42. The method according to claim 41 wherein the polymerase chain reaction format 
is SSCP. 

43. The method according to any one of claims 37 to 42 wherein the isolated DNA 
molecule or oligonucleotide comprises at least 20 contiguous nucleotides derived from 
SEQ ID No: 1 or SEQ ID No:3 or a cgmplemratary strand th^eof 

44. The method according to claim 43 wherein the isolated DNA molecule or 
oligonucleotide comprises at least SO contiguous nucleotides derived from SEQ ID No:l 
or SEQ ID No:3 or a complementary strand thereof. 



wo 97/191 77 PCT/AU96/00747 

r 

-68- 

45. The method according to claim 44 wherein the isolated DNA molecule or 
oligonucleotide comprises at least 100 contiguous nucleotides derived from SEQ ID No; 1 
or SEQ ID No:3 or a complementary strand thereof. 



46. A method for treating a patient suffering from a-N-acetylglucosaminidase 
deficiency said method comprising administering to said patient an effective amount of 
recombinant mammalian a-N-acetylglucosaminidase or an active fragment or derivative 
thereof 



47. The method according to claim 46 wherein the mammalian a-N- 
acetylglucosaminidase comprises a sequence of amino acids substantially the same as the 
amino acid sequence of human a-N-acetylglucosaminidase. 

48. The method according to claim 47 wherein the patient is suffering from 
mucopolysaccharidosis type IIIB. 

49. The method according to any one of claims 46 to 48 wherein the recombinant a-N- 
acetylglucosaminidase is produced in mammalian cells. 

50. The method according to claim 49 wherein the mammalian cells are capable of 
glycosylating the recombinant a-N-acetylglucosaminidase produced therein. 

51. The method according to claim 50 wherein the recombinant a-N- 
acetylglucosaminidase is in a glycosylated form. 

52. The method according to claim 51 wherein the glycosylated form of the 
recombinant a-N-acetylglucosaminidase has a molecular weight as determmed using 
SDS/PAGE of at least approximately 79kDa. 

53. The method according to claim 52 wherein the glycosylated form of the 
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recombinant a-N-acetylglucosaminidase has a molecular weight as determined using 
SDS/PAGE of approximately 79 kDa to 89 kDa. 

54. The method according to anyone of claims 46 to 53 wherein the recombinant a-N- 
acetylglucosaminidase comprises a sequence of amino acids substantially as set forth in 
SEQ ID No:2 or having at least 40% similarity to all or a part thereof. 

55. The method according to claim 54 wherem the percentage similarity to SEQ ID 
N0:2 is at least 60%. 

56. The method according to claim 55 wherein the percentage similarity to SEQ ID 
N0:2 is at least 80%. 

57. A method for treating a patient suffering from a-N-acetylglucosaminidase 
deficiency said method comprising administering to said patient an effective amount of 
recombinant mammalian a-N-acetylglucosaminidase or an active fragment or derivative 
thereof, wherein said recombinant mammalian a-N-acetylglucosaminidase is produced by 
expression of a nucleic acid molecule according to any one of claims 14 to 18. 

58. The method according to any one of claims 46 to 57 wherein administration of the 
recombinant mammalian a-N-acetylglucosaminidase is by oral, intravenous, suppository, 
intraperetoneal, intramuscular, intranasal, intradermal or subcutaneous administration by 
infusion or implantation or by enzyme replacement therapy or by gene therapy. 

59. The method according to claim 58 wherein the method of administration is by 
enzyme replacement therapy. 

60. A pharmaceutical composition comprising a recombinant mammalian a-N- 
acetylglucosaminidase or an active fragment or derivative thereof and one or more 
pharmaceutically acceptable carriers and/or diluents. 
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61 . The phannaceutical composition according to claim 60 wherein the recombinant 
mammalian a-N*acetylglucosaminidase comprises a sequence of amino acids substantially 
the same as human a-N-acetylglucosaminidase. 

62. The pharmaceutical composition according to claims 60 or 61 wherein the 
recombinant manunalian a-N-acetylglucosaminidase is produced in a mammalian cell. 

63. The pharmaceutical composition according to claim 62 wherein the mammalian cell 
is a CHO cell line which is capable of glycosylating the recombinant mammalian a-N- 
acetylglucosaminidase. 

64. The phannaceutical composition according to any one of claims 60 to 63 wherein 
the a-N-acetylglucosaminidase is glycosylated. 

65. The phannaceutical composition according to claim 64 wherein the recombinant 
a-N-acetylglucosaminidase has a molecular weight as determined using SDS/PAGE of at 
least approximately 79 kDa. 

66. The pharmaceutical composition according to claim 65 wherein the recombinant 
a-N-acetylglucosaminidase has a molecular weight as determined using SDS/PAGE of 
approximately 79kDa to 89 kDa. 

67. The pharmaceutical composition according to any one of claims 60 to 66 wherein 
the recombinant a-N-acetylglucosaminidase comprises a sequence of amino acids 
substantially as set forth in SEQ ID No:2 or having at least 40% similarity to all or part 
thereof. 

68. The pharmaceutical composition according to claim 67 wherein the percentage 
similarity to SEQ ID N0:2 is at least 60%. 



69. The pharmaceutical composition according to claim 68 wherein the percentage 
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similarity to SEQ ED N0:2 is at least 80%. 

70. A phannaceutical composition comprising recombinant mammalian a-N- 
acetylglucosaminidase or an active fragment or derivative thereof and one or more 
pbarmaceuticaily acceptable carriers and/or diluents herein said recombinant mammalian 
a-N-acetylgiucosaminidase is produced by expression of a nucleic acid molecule according 
to any one of claims 14 to 18. 

71. A pharmaceutical composition comprising recombinant mammalian a-N* 
acetylglucosaminidase or an active fragment or derivative thereof and one or more 
pbarmaceuticaily acceptable carriers and/or diluents when used in the method according 
to any one of claims 46 to 59. 

72. Use of recombinant mammalian a-N-acetylglucosaminidase or an active fragment 
or derivative thereof in the manufacture of a medicament for the treatment of o-N- 
acetylglucosaminidase deficiency in a patient 

73. The use according to claim 72 wherein the recombinant mammalian a-N- 
acetylglucosaminidase comprises a sequence of amino acids substantially the same as the 
amino acid sequence of human a-N-acetylglucosaminidase. 

74. The use according to claims 72 or 73 wherein the patient is suffering from 
mucopolysaccharidosis type 



III: 



75 . The use according to claim 74 wherein the recombinant a-N-acetylglucosaminidase 
is expressed in mammalian ceUs. 

76. The use according to claim 61 wherein the cells are CHO cells. 



77. The use according to any one of claims 72 to 76 wherein the o-N- 
acetylglucosaminidase is glycosylated. 



• 1 

I 
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78. The use according to claim 77 whereia the recombinant a-N- 
acetylglucosaminidase has a molecular weight as determined using SDS/PAGE of at least 
approximately 79kDa. 

79. The use according to claim 78 wherein the recombinant a-N- 
acetylglucosaminidase has a molecular weight as determined using SDS/PAGE of 
approximately 79 kDa to 89 kDa. 

80. The use according to anyone of claims 72 to 79 wherein the recombinant of a-N- 
acetylglucosaminidase comprises a sequence of amino acids substantially as set forth in 
SEQ ED No:2 or has at least 40% similarity to all or a part thereof 

81 . The use according to claim 80 wherein the percentage similarity to SEQ ID NO:2 
is at least 60%. 

82. The use according to claim 80 wherein the percentage similarity to SEQ ID NO:2 
is at least 80%. 

83. A nucleic acid molecule comprising a sequence of nucleotides encoding or 
complementary to a sequence encoding a polypeptide capable of hydrolysing the terminal 
a-N-acetylglucosaminidase residues present at the non-reducing terminus of fragments of 
heparan sulphate and heparin and wherein said nucleotide sequence is capable of 
hybridising under at least low stringency conditions to the nucleotide sequence set forth 
inSEQlDNO:!. 

84. A nucleic acid molecule comprising a sequence of nucleotides encoding or 
complementary to a sequence encoding a polypeptide capable of hydrolysing the terminal 
a-N-acetylgluoosaminidase residues present at the non-reducing terminus of fragments of 
heparan sulphate and heparin and wherein said nucleotide sequence is capable of 
hybridising under at least low stringency conditions to the nucleotide sequence set forth 
in SEQ ID NO: 3. 
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85. A recombinant polypeptide comprising a sequence of amino acids corresponding 
to the amino sequence set forth in SEQ ID NO: 2 or having at least 40% similarity thereto 
and encoded by a nucleic acid molecule which is capable of hybridising to the nucleotide 
sequence set forth in SEQ ID NO: 1 or SEQ ID N0:3 under at least low stringency 
conditions > 

86. A genetic construct comprising the nucleic acid molecule according to any one of 
clainos 1 to 18 or claims 83 to 84, operably connected in the sense orientation to a promoter 
sequence such that said genetic construct is capable of being expressed in a eukaryotic or 
prokaryotic cell to produce a recombinant mammalian a-N-acetylglucosaminidase or a 
fragment or derivative thereof 

87. The genetic construct according to claim 86 wherein the promoter is capable of 
regulating expression of the recombinant a-N-acetylglucosaminidase in a mammalian 
cell 

88. The genetic construct according to claim 87 wherein the promoter is the CMV 
promoter sequence or a promoter derived therefrom. 

89. The genetic construct according to any one of claims 86 to 88 further comprising 
a transcription terminator sequence. 

90. The genetic construct according to any one of claims 86 to 89 when used to express 
or over-express a-N-acetylglucosaminidase in a eukaryotic or prokaryotic cell, 

91. An antibody to a-N-acetylgiucosaminidase or a recombinant a-N- 
acetylglucosaminidase according to any one of claims 19 to 36 or an antigenic fragment 
thereof. 

92. The antibody according to claim 91 further defined as a polyclonal antibody 
molecule. 
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93. The antibody according to claim 92 further defined as a monoclonal antibody 
molecule. 
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